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SECTION I. 

INTRODUCTORY REMARKS 


Paragraph 


Scope of this text. 1 

f Mental equipment necessary for cryptanalytic work . 2 

Validity of results of cryptanalysis. 3 


1. Scope of this text. - a. It is assumed that the student has 
studied the two preceding texts forming part of this series, viz., Special 
Text No. 165, Elementary Military Cryptography, and Special Text No. 166, 
Advanced Military Cryptography. The latter texts deal exclusively with 
cryptography as defined therein; that is, with the various types of ciphers 
and codes, their principles of construction, and their employment in 
cryptographing and decryptographing messages. Particular emphasis was 
placed upon such means and methods as are practicable for military usage. 

It is also assumed that the student has firmly in mind the technically 
precise, special nomenclature employed in those texts, for the terms and 
definitions therein will all be used in the present text, with essentially 
the same significances. If this is not the case, it is recommended that 
the student review his preceding work, in order to regain a familiarity 
with the specific meanings assigned to the terms used therein. There will 
be no opportunity herein to repeat thi6 information and unless he under¬ 
stands clearly the significance of the terms employed, his progress will 
be retarded. 

b. This text constitutes the first of a series of texts on crypt¬ 
analysis. Although most of the information contained herein is applicable 
to cryptograms of whatever type and source, special emphasis will be laid 
upon'the principles and methods of solving military cryptograms. Except 
for an introductory discussion of fundamental principles underlying the 
science of cryptanalytics, this first text in the series will deal solely 
with the principles and methods for the analysis of monalphabetic substi¬ 
tution ciphers. Even with this limitation it will be impossible to dis¬ 
cuss all the many variations of this one type, but with a firm grasp upon 
the general principles no difficultie.s should be experienced with, any 
variations that may be encountered. 

c. This and some of the succeeding texts will deal only with ele¬ 
mentary types of cipher systems not because they may bo encountered in 
military operations but because their study is essential to an understand¬ 
ing of the principles underlying the solution of the modern, very much 
more complex types of ciphers and codes that are employed by the larger 
governments today in the conduct of their military affairs in time of war. 

d. All of this series of texts will deal only with the solution of 
visible secret writing. At some future date texts doaling with the solu¬ 
tion of invisible secret writing, and with socrot signalling systoms may 
be prepared. 
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2. Mental equipment necessary for cryptanalytic work. - a. Captain 
Parker Hitt, in tho first United States Army manual 1 dealing vdth cryp¬ 
tography, opens tho first chrptor of his valuable treatise with tho follow¬ 
ing sent oneos 


"Success in dealing with unknown ciphers is measured 
by these four things in the order named, perseverance, 
careful methods of analysis, intuition, luck." 


These words are as true today as they were then. There is no royal road 
to success in the solution of cryptograms. Hitt goes on to says 

"Cipher work will have little permanent attraction 
for one who expects results at once, without labor, for 
there is a vast amount of purely routine labor in the 
preparation of frequency tables, the rearrangement of 
ciphers for examination, and the trial and fitting of 
letter to letter before the message begins to appear." 


The present author deems it advisable to add that the kind of work in- ‘ 
volved in solving cryptograms is not at all similar to that involved in 
solving "cross-word puzzles," for example. The wide vogue the latter have 
had and continue to have is due to the appeal they make to the quite com¬ 
mon instinct for mysteries of one sort or another? but in solving a cross¬ 
word puzzle there is usually no necessity for performing any preliminary 
labor, and palpable results become evident after the first minute or two 
of attention. This successful start spurs the cross-word "addict" on to 
complete the solution, v/hich rarely requires more than an hour's time. 
Furthermore, cross-word puzzles are all alike in basic principle and once 
understood, there is no more to learn. Skill cones largely from the embel¬ 
lishment of one's vocabulary, though, to bo sure, constant practice and 
exercise of the imagination contribute to the ease and rapidity with which 
solutions are genorally reacnod. In solving cryptograms, however, many 
principles must bo learned, for there are many diffo-ont systems, of vary¬ 
ing degrees of complexity. Sven somo of the simpler varieties require the 
preparation of tabulations of one sort or another, which many people find 
irksome; moreover, it is only toward the very close of the solution that 
results in the form of intelligible text become evident. Often, indeed, 
the student will not even know whether he is on the right track until he 
has performed a large amount of preliminary "spade work" involving many 
hours of labor. Thus, without at least a willingness to pursue a fair 
amount of theoretical study, and a more than average amount of patience 
and perseverance , little skill and experience can be gained in the rather 


Hitt, Capt. Parker. Manual for the Solution of Axilitary Ciphers . Army 
Sorvice Schools Press, Fort Loavonworth, Kansas, 1916. 2d Edition, 1918 
(3oth out of print) 
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difficult ait of cryptanalysis. General Givierge's remarks in this con¬ 
nection are of interest. He says^. 

"The cryptanalyst's attitude must be that of William 
the Silent: No need to hope in order to undertake, nor 
to succeed in order to persevere." 

b. As regards Hitt's reference to careful methods of analysis, be¬ 
fore one can be said to be a cryptanalyst worthy of the name it is neces¬ 
sary that one should have first a sound knowledge of the basic principles 
of cryptanalysis, and secondly a long, varied, and active practical ex¬ 
perience in the successful application of those principles. It is not 
sufficient to have read treatises on this subject. One month's actual 
practice in solution is worth a whole year's mere reading of theoretical 
principles. An exceedingly important element of success in solving the 
more intricate ciphers is the possossion of tho rather unusual mental 
faculty designated in gcnoral terms as tho power of inductive and deduc¬ 
tive reasoning. Probably this is an inherited rather than an acquired 
faculty; the best sort of training for its emergence, if latent in tho 
individual, and for its development is the study of the natural sciences, 
such as chemistry, physics, biology, goology, and tho liko. Other sciences 
such as linguistics and philology are also excellent. Aptitude in mathe¬ 
matics is quite important, more expecially in the solution of ciphers than 
of codes. 

c. An active imagination, 0 " perhaps what Hitt and other writers 
call intuition , is essential, but mere imagination uncontrolled by a 
judicious spirit will more often be a hindrance than a help. In prac¬ 
tical cryptanalysis the imaginative or intuitive faculties must, in other 
words, be guided by good judgment, by practical experience, and by as 
thorough a knowledge of the general situation or extraneous circumstances 
that led to the sending of the cryptogram as is possible to obtain. In 
this respect the many cryptograms exchanged between correspondents whose 
identities and general affairs, commercial, social, or political, are 
known are far more readily solved than are isolated cryptograms exchanged 
between unknown correspondents, dealing with unknown subjects. It is ob¬ 
vious that in tho former case thore are good data upon which the intuitive 
pov/ers of the cryptanalyst can be brought to bear, whereas in the latter 
case no such data are available. Consequently, in the absence of such 
data, no matter how good the imagination and intuition of tho cryptanalyst, 
these powers are of no particular service to him. Some writers, however, 
regard the intuitive spirit as valuable from still another viewpoint, as 
may be noted in the following; 2 

Givierge, General Marcel. Cours de Cryptographie , Paris, 1925. (P.301) 


Lange et Soudart. Traite de Cryptographie. Librairie Felix Alcan, 
Paris, 1925. 
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"Intuition, like a flash of lightning, lasts only 
for a second. It generally conies when one is tormented 
by a difficult decipherment and when one reviews in his 
mind the fruitless experiments already tried. Suddenly 
the light breaks and one finds after a few minutes what 
previous days of # labor were unable to reveal." 

This, too, is true, but unfortunately there is no way in which the intui¬ 
tion may be summoned at will, -.Then it is most needed.There are certain 

The following extracts are of interest in this connection: 

"The fact that the scientific investigator works 
50 per cont of his time by non-rational moans is, it 
seems, quite insufficiently recognized. Thero is with¬ 
out the least doubt an instinct for research, and often 
the most successful invostigators of naturo aro quito 
unable to give an account of their reasons for doing 
such and such an experiment, or for placing side by side 
two apparently unrelated facts. Again, ono of the most 
salient traits in the character of tho successful scien¬ 
tific worker is tho capacity for knowing that a point is 
proved when it would not appear to bo proved to an out¬ 
side intelligonco functioning in a purely rational man¬ 
ner; thus tho investigator foels that somo proposition 
is truo, and proceeds at onco to tho noxt sot of oxpori- 
monts without waiting and wasting time in tho elaboration 
of tho formal proof of tho point which hoavior minds would 
nood. Questionless such a scientific intuition may and 
{loos sometimos load invostigators astray, but it i6 quito 
cortain that if thoy did not widely mako use of it, they 
would not get a quartor as far as thoy do. Experiments 
confirm each other, and a false step is usually soon dis¬ 
covered. And not only by this partial replacement of 
reason by intuition does the work of science go on, but 
also to the born scientific worker - and emphatically they 
cannot be made - the structure of the method of research 
is as it were given, he cannot explain it to you, though 
he may be brought to agree a postiori to a formal logical 
presentation of the way the method works." - Excerpt from 
Needham, Joseph. "The Sceptical Biologist," page 79. 

London, 1929. 

"The essence of scientific method, quite simply, is 
to try to see how data arrange themselves into casual con¬ 
figurations. Scientific, problems are solved by collect¬ 
ing data and by 'thinking about them all the time.' We 
need to look at strange things until, by the appearance 
of known configurations, they seem familiar, and to look 
at familiar things until we see novel configurations which 
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authors who regard as indispensable the possession of a somewhat rare, 
rather mysterious faculty that they designate by the word "flair," or 
by the expression "cipher brains." Even so excellent an authority as 
General Givierge,^- in referring to this mental facility, uses the fol¬ 
lowing wordss " ... and this aptitude of mind which some authors con¬ 
sider a special gift, and which they call intuition, or even, in its 
highest manifestation, clairvoyance ... although the present author 
believes a special .aptitude for the work is essential to cryptanalytic 
success, he is sure thorc is nothing mysterious about the matter at all. 
Special aptitude is prerequisite to success in all fields of endeavor. 
Thoro are, for oxamplo, thousands of physicists, hundreds of excellent 
ones, but only a handful of world-wide fnrao. Should it bo said, then, 
that a physicist who has achieved very notable succoss in his field has 
dono so bocauso ho is tho fortunate possessor of a mysterious faculty? 
That he is fortunate in possessing a spocial aptitude for his subject is 
granted, but that thoro is anything mysterious about it, partaking of 
tho nature of clairvoyance (if, indoed, the lattor is a roality ) is not 
granted. Vtfhila tho ultimate nature of any mental procoss seems to bo 
as complete a mystery today as it has over boon, tho present author 
’.vould like to soo tho superficial voil of mystory removed from a sub¬ 
ject that has bean shrouded in mystery from even before the Middle Ages 
down to our own times. (The principal and easily understandable reason 
for this is that governments have always closely guarded cryptographic 
secrets and anything so guarded soon becomes "mysterious.") He would, 
rather, have the student approach the subject as he might approach any 
other science that can stand on its own merits with other sciences, be¬ 
cause cryptanalytics, like other sciences, has a practical importance in 
human affairs. It presents to the inquiring mind an interest in its own 


make them appear strange. ’Ve must look at events until 
they become luminous. That is scientific method ... . 
Insight is the touchstone ... . The application of in¬ 
sight as the touchstone of method enables us to evaluate 
properly the role of imagination in scientific method. 

Tho scientific process is akin to the artistic process, 
it is a process of selecting out those elements of ex¬ 
perience which fit together and recombining them in the 
mind. Much of this kind of research is simply a cease¬ 
less mulling over, and even the physical scientist, has 
considerable need of an armchair." "Our view of scien¬ 
tific method as a struggle to obtain insight forces the 
admission that science is half art." "Insight is the 
unknown quantity which has eluded students of scientific 
method." - Excerpts from an article entitled "Insight and 
Scientific Method" by Willard Waller, in The American 
Journal of Sociology . Vol. XL, 1934. 


1 


Loc. cit., p. 302 
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right as a branch of knowledge; it, too, holds forth many difficulties and 
disappointments, and these are all the more keenly felt when the nature of 
these difficulties is not understood by those unfamiliar with the special 
circumstances that very often are the real factors that led to success in 
other cases. Finally, just as in the other sciences wherein many men labor 
long and earnestly for the true satisfaction and pleasure that comes from 
work well-done, so the mental pleasure that the successful cryptanalyst 
derives from his accomplishments is very often the only reward for much of 
the drudgery that he must do in his daily work. Givierge's words in this 
connection are well worth quoting. Ho says (p. 301); 

"Some studies will last for years before bearing 
fruit. In the case of others, cryptanalysts undertaking 
thorn never got any result. But, for a cryptanalyst who 
likes the work, the joy of discoveries effaces the memory 
of his hours of doubt and impatience." 

d. With his usual deft touch, Hitt says of the elemont of luck, as 
regards the role it plays in analysis: 

"As to luck, there is the old miners' proverb 'Gold 
is v/here you find it'." 

Tho cryptanalyst is lucky whon one of the corrospondonts whose ciphers ho 
is studying makes a blunder that givos the necessary clue; or whon ho finds 
two cryptograms identical in toxt but in difforont koys in tho same system; 
or whan ho finds two cryptograms identical in toxt but in different systems, 
and 30 on. Th3 element of luck is thore, to be sure, but tho cryptanalyst 
must be on tho alort if he is to profit by these lucky "breaks." 

e. If the present author were asked to state, in view of the progress 
in the field since 1916, what elements might be added to the four ingredi¬ 
ents Hitt thought essential to cryptanalytic success, he would be inclined 
to mention the following: 

(1) A broad, general education, embodying interests 
covering as many fields of practical knowledge as possible. 

This is useful because the cryptanalyst is often called 
upon to solve messages dealing with the most varied of 
human activities, and the more he knows about these ac¬ 
tivities, the easier his task. 

(2) Access to a large library of current literature 
and wide and direct contacts with sources of collateral 
information. These often afford clues as to the contents 
of specific messages. For example, to be able instantly 
to have at his disposal a newspaper report or a personal 
report of events described or referred to in a message 
under investigation goes a long way toward simplifying or 
facilitating solution. Government cryptanalysts are some¬ 
times fortunately situated in this respect, especially 
whore various agencies work in harmony. 


REF ID:A64644 


- 7 - 

(3) Proper coordination of effort. This includes 
the organization of cryptanalytic personnel into har¬ 
monious, efficient teams of cooperating individuals. 

(4) Under mental equipment he would also include 
the faculty of being able to concentrate on a problem 
for rather long periods of time, without distraction, 
nervous irritability, and impatience. The strain under 
which cryptanalytic studies are necessarily conducted 
is quite severe and too long-continued application has 
the effect of draining nervous energy to an unwholesome 
degree, so that a word or two of caution may not hero 
be out of place. One should continue at work only so 
long as a peaceful, calm spirit prevails, whether the 
work is fruitful or not. But just as soon as the mind 
becomes wearied with the exertion, or just as soon as a 
fooling of hopelessness or mental fatigue intervenes, 
it is better to stop completely and turn to other ac¬ 
tivities, rost, or play. It is ossontial to remark 
that systematization and orderliness of work are aids 
in reducing nervous tension and irritability. On this 
account it is better to take the time to prepare the 
data carefully, rewrite the text if necessary, and so 
on, rather than work with slipshod, incomplete, or im¬ 
properly arranged material. 

(5) A retentive memory is an important asset to 
cryptanalytic skill, especially in the solution of codes. 

The ability to remember individual groups, their ap¬ 
proximate locations in other messages, the associations 
they form with other groups, their peculiarities and 
similarities saves much wear and tear of the mental 
machinery, as well as much time in looking up these 
groups in indexes, 

f. It may be advisable to add a word or two at this point to prepare 
the student to expect slight mental jarrs and tensions which will almost 
inevitably come to him in the conscientious study of this and the subse¬ 
quent texts. The present author is well aware of the complaint of students 
that authors of texts on cryptanalysis baso much of their explanation upon 
their fore-knowledge of the "answer”- which the student does not know while 
he is attempting to follow the solution with an unbiased mind. They com¬ 
plain too that those authors use such oxprossions as "obviously", "natural¬ 
ly", "of course", "It is evident that", and so on, when the circumstances 
seem not at all to warrant their use. There is no question but that this 
sort of troatmont is apt to discourage the studont, especially whon the 
point elucidated becomes clear to him only after many hours labor, whereas, 
according to the book, the author noted the weak spot'at the first moment's 
inspection. The present author can only promise to try to avoid making the 
steps appear to be much more simple than they really are, and to suppress 
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glaring instances of unjustifiable '•jumping at conclusions". At the same 
time he must indicate that for pedagogical reasons in many cases a message 
has been consciously "manipulated" so as to allow certain principles to 
become more obvious in the illustrative examples than they ever are in 
practical work. During the course of some of the explanations attention 
will even be directed to cases of unjustified inferences. Furthermore, of 
the student who is quick in observation and deduction, the author will only 
ask that he bear in mind that if the elucidation of certain principles 
seems prolix and occupies more space than necessary, this is occasioned by 
the author's desire to carry tho explanation forward in very short, oasily- 
comprohonded, and plainly-doscribod stops, for the bonofit of students who 
arj perhaps a bit slower to grasp but who, onco they understand, aro able 
to retain and apply principles slowly learned just as woll, if not better 
than tho students who loarn moro quickly. 

3. Validity of results of cryptanalysis. - Valid, or authentic crypt- 
analytic solutions cannot and do not represent "opinions" of tho crypt¬ 
analyst. They are valid only so far as they are wholly objective, and are 
susceptible of demonstration and proof, employing authentic, objective 
methods. It should hardly be necessary (but an attitude frequently en¬ 
countered among laymen makes it advisable) to indicate that the validity 
of the results achieved by any serious cryptanalytic studies on authentic 
material rests upon the same sure foundations and are reached by the same 
general steps as the results achieved by any other scientific studies; viz., 
observation, hypothesis, deduction and induction, and confirmatory experi¬ 
ment. Implied in the latter is the possibility that two or more qualified 
investigators, each working independently upon the same material, will 
achieve identical (or practically identical) results. Occasionally a 
pseudo-cryptanalyst offers "solutions" which cannot withstand such tests; 
a second, unbiased, investigator working independently either cannot con¬ 
sistently apply the methods alleged to have been applied by the pseudo¬ 
cryptanalyst, or else, if he can apply them at all, the results (plain¬ 
text translations) are far different in the two cases. The reason for 
this is that in such cases it is generally found that the "methods" are 
not clear-cut, straight forward or mathematical in character. Instead, 
they often involve the making of judgments on matters too tenuous to 
measure, weigh, or otherwise subject to careful scrutiny. In such cases, 
the conclusion to which the unprejudiced observer is forced to come is 
that the alleged "solution" obtained by the first investigator, the pseudo¬ 
cryptanalyst, is purely subjective. In nearly all cases where this has 
happened (and they occur from time to time) there has been uncovered 
nothing which can in any way be used to impugn the integrity of the 
pseudo-cryptanalyst. The worst that can bo said of him is that he has 
become a victim of a special or peculiar form of self-delusion, and that 
his desire to solve tho problem, usually in accord with some previously- 
formed opinion, or notion, has over-balanced, or undermined, his judgment 
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^ Specific reference can be made to the following typical "case histo¬ 
ries"; 

Donnelly, Ignatius, The Great Cryptogram . Chicago, 188B. 

Owen, Orville W., Sir Francis Bacon's Cipher Story . Detroit, 18S5. 

Callup, Elizabeth Wells, Francis Bacon’s Biliteral Cipher . Detroit, 1900. 
Margoliouth, D. S., The Homer of Aristotle . Oxford, 1923. 

Newbold, William Romaine, The Cinher of Roger Bacon . Philadelphia, 1928. 
(For a scholarly and complete demolition of Profossor Kewbold's 
work, see an article entitled " Roger Bacon and the Voynich MS ", by 
John M. Manly, in Speculum , Vol. VI, No. 3, July 193lTT 
Arensberg, '/alter Conrad, The Cryptog r aphy of Shakespeare . Los Angeles, 
1922. 

The Shakespearean Mystery . Pittsburgh, 1928. 

The Baconian Keys . Pittsburgh, 1928. 

Feely, Joseph Martin, The Shakespearean Cypher . Rochester, N. Y., 1931. 
Deciphering Shakespeare . Rochester, N. Y., 1934. 
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SECTION II 

FUHJAlAENTAL PRINCIPLES 


Paragraph 


The four basic operations in cryptanalysis.. 4 

The determination of the language employed. 5 

The determination of the general system . ... 6 

The reconstruction of the specific key.. 7 

The reconstruction of the plain text. 8 


4. The four basic operations in cryptanalysis. - a. The solution 
of practically every cryptogram involves four fundamental operations or 
steps; 

(1) The determination of the language employed in 
the plain-text version. 

(2) The determination of the general system of 
cryptography employed. 

(3) The reconstruction of the specific key in the 
case of a cipher system, or the reconstruction, partial 
or complete, of the code book, in the case of a code 
system; or both, in the case of an enciphered code system. 

(4) The reconstruction or establishment of the plain 

text. 

b. These operations will be taken up in the order in which they are 
given above and in which they usually are performed in the solution of 
cryptograms, although occasionally tho second stop may precede the first. 

5. The determination of tho language employed. - a. There is not 
much that need be said with respect to this operation except that the de¬ 
termination of the language employed seldom comes into question in the 
case of studies made of the cryptograms of an organized enemy. By this is 
meant that during war time the enemy is of course known, and it follows, 
therefore, that the language he employs in his messages will almost cer¬ 
tainly be his native or mother tongue. Only occasionally nowadays is this 
rule broken. Formerly it often happened, or it might have indeed been the 
general rule, that the language used in diplomatic correspondence was not 
the mother tongue, but French. In isolated instances during tho World War, 
tho Gormans usod English when their own language could for ono roason or 
another not be employed. For example, for a yoar or two bofore the ontry 
of tho United States into that war, during tho time araorica was neutral 

and the Gorman Government maintained its embassy in Washington, the mossagos 
exchanged between tho Foroign Office in Berlin and tho Embassy in Washing¬ 
ton wore cryptographed in English, and a copy of tho code usod was deposited 
with the Department of State and our censor. Another instance is found in 
the case of certain Hindu conspirators who were associated with and partially 
financed by the German Government in 1915 and 1916; they employed English as 
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the language of their cryptographic messages. Occasionally the crypto¬ 
grams of enemy agents may be in a language different from that of the 
enemy. But in general these ere, as has been said, isolated instances; 
as a rule, the language used in cryptograms exchanged beween members of 
large organisations is the mother tongue of the correspondents. lJ here 
this ia not the case, that is, when cryptograms of unknown origin must 
be studied, the cryptanalyst looks for any indications on the cryptograms 
themselves which may lead to a conclusion as to the language employed. 
Address, signature, and plain-language words in the preamble or in the 
body of the text all come under careful scrutiny, as '-fell as all extran¬ 
eous circumstances connected ” r ith the manner in which the cryptograms 
were obtained, the person on whom they were found, or the locale of their 
origin and destination. 

b. In special cases, or under special circumstances a clue to the 
language employed is found in the nature and composition of the crypto¬ 
graphic text itself. For example, if the letters K and W are entirely 
absent or appear very rarely in massages, it may indicate that the language 
is Spanish, for these letters arc absent in the alphabet of that language 
and are used only to spell foreign words or names. Thr presence of ac¬ 
cented letters or letters marked with special signs of one sort or another, 
peculiar to certain languages, will sometimes indicate the language used. 
The Japanese tlorse telegraph alphabet and the Russian horse telegraph 
alphabet contain combinations of dots and dashes which are peculiar to 
those alphabets and thus the interception of messages containing these 
special lw orse combinations at once indicates the language involved. 

Finally, there are certain peculiarities of alphabetic 1’nguagos which, 

in certain typos of cryptograms (pure transposition), give clues as to 
the language used. For example, the freouent digraph CH, in German, leads 
to the presence, in cryptograms of the typo mentioned, of many isolated 
C's and H’a; if this is noted, tho cryptogram may be assumed to bo in 
Gorman. 

c. In some cases it is perfectly possible to perform certain steps 
in cryptanalysis before the language of tho cryptogram has been definitely 
determined. Frequency studies, for example, may be made and alalytic 
processes performed without this knowledge, and by a cryptanalyst wholly 
unfamiliar with the language even if it has b'en identified, or who knows 
only enough about the language to enable him to recognize Valid combina¬ 
tions of letters, syllables, or o few common words in that language. He 
may, after this, call to his assistance a translator who may not be a 
cryptanalyst but who can materially aid in making necessary assumptions 
based upon his special knowledge of the characterise ice of the language 

in question, ^hus, cooperation between cryptanalyst and translator 
results in solution. 

I The writer has seen in print statements that "during tho Uorld ’Var .... 
decoded messages in Japanese and Russian without knowing a word of either 
language." H e has even heard alleged cryptanalysts make such fantastic 
claims. But to say that ib is possible to solve a cryptogram in a foreign 
language "without knowing a word of that language" is quite o different 
thing from saying that it is possible to do so with only p slight knowledge 
of the language. Tho absurdity and fdnount of exaggeration contained in the 
former statement will soon become obvious to the student. It may be stated 
without cavil that the better the cryptanalyst's knowledge of the language, 
the easier is his work. 
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6 . The determination of the general system. - a. Except in the case 
of the more simple types of cryptograms, the determination of the general 
system according to which a given cryptogram has been produced is usually 
a difficult, if not the most difficult, step in its solution. The reason 
for this is not hard to find. 

b. As will become apparent to the student as he proceeds with his 
study, in the final analysis, the solution of every cryptogram involving 

a form of substitution depends upon its reduction to monoalphabetic terms, 

if it is not originally in those terms . This is true not only of ordinary 
substitution ciphers, but also of combined substitution-transposition 
ciphers, and of enciphered code. If the cryptogram must be reduced to 
monoalphabetic terms, the manner of its accomplishment is either indicated 
by the cryptogram itself, by external or internal phenomena which become 
apparent to the cryptanalyst as he studies the cryptogram. If this is im¬ 
possible, or too difficult the cryptanalyst must, by one means or another, 
discover how to accomplish this reduction by bringing to bear all the 
special or collateral information he can get from all the sources at his 
command. If both these possibilities fail him, there is little left but 
the long, tedious, and often fruitless process of elimination. In the 
case of transposition ciphers of the more complex type, the discovery of 
the basic method is often simply a matter of long and tedious elimination 
of possibilities. For cryptanalysis has unfortunately not yet attained 
and may indeed never attain the precision found today in qualitative 
analysis in chemistry, for example, where the analytic process is absolutely 
clear cut and exact in its dichotomy. A few words in explanation of what 
is meant may not be amiss. TiJhen a chemist 6eeks to determine the identity 
of an unknown substance, he applies certain specific reagents to the sub¬ 
stance and in a specific sequence. The first reagent tolls him definitely 
into which of two primary classes the unknown substance falls, say class 
A. He then applies a second tost with another specific reagent, which 
tells him again quite definitely into which of two secondary classes the 
unknown substance falls, and so on, until finally he has reduced the un¬ 
known substance to its simplest terms and has found out what it is. In 
striking contrast to this situation, cryptanalysis affords exceedingly few 
"reagents" or tests that .nay bo applied to determine positively that a 
given cipher belongs to om or the other of two systems yielding externally 
similar results. And this is what makes tho analysis of an isolated, com¬ 
plex cryptogram so difficult. Note the limiting adjective "isolated" in 
tho forogoing sontonco, for it is used advisedly. It is not often that tho 
genoral system fails to disclose itself or cannot bo discovered by painstak¬ 
ing investigation when thoro is a groat volume of text accumulating from a 
regular traffic between numerous correspondents in a large organization. 
Sooner or later the system becomes known, either as the result of blunders 
and carelessness on the part of the personnel entrusted with the cryptograph¬ 
ing of the messages, or the accumulation of text itself makes possible the 
determination of the general system by cryptanalytic studies. But in the 
case of a single or even a few isolated cryptograms concerning which little 
or no information’can be gained by the cryptanalyst, he is often unable, 
without a knowledge of, or a shrewd guess as to the general system employed, 


to decompose the heterogeneous text of the cryptogram into homogeneous, 
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raonoalphabetic text, which is Lhe ultimate and essential step in analysis. 
The only knowledge that the cryptanalyst can bring to his aid in this most 
difficult step is that gained by long experience and practice in the analy¬ 
sis of many different types of systems. 

c. On account of the complexities surrounding this particular phase 
of cryptanalysis, and because in any scheme of analysis based upon succes¬ 
sive eliminations of alternatives the analyst can only progress as far as 
the extent of his own knowledge of all the possible alternatives will per¬ 
mit, it is necessary that detailod discussion of the eliminative process 

be postponed until the student has covorod most of tne field. For example, 
the student will perhaps want to know at once how ho can distinguish be¬ 
tween a cryptogram that is in code or enciphered code from one that is in 
cipher. It is at this stage of his studies impracticable to give him any 
helpful indications on his question. In return it may bo askod of him 
why he should expect to bo able to do this in the early stagos of his 
studies when often tho oxporionced oxport cryptanalyst is baffled on the 
same scoro. 

d. Hovertholoss, in lieu of moro procise tests not yet discovered, 

a general guide that may be useful in cryptanalysis will bo built up, stop 
by stop as tho student progrossos, in the form of a series of charts com¬ 
prising what may bo designated " An Analytical Key For Cryptanalysis 11 (See 
Par. 50.) It may be of assistance to the student if, as he proceeds, he 
will carefully study the charts and note the place which the particular 
cipher he is solving occupies in the general cryptanalytic panorama. They 
admittedly constitute only very brief outlines, and can therefore be of but 
little direct assistance to him in the analysis of the more complex types 
of ciphers he may encounter later on. So far as they go, however, they 
may be found to be quite useful in the study of elementary cryptanalysis. 

For the experienced cryptanalyst they can sorve only as a means of assur¬ 
ing that no possible step or process is inadvertently overlooked in attempts 
to solve a difficult cipher. 

e. Much of the labor involved in cryptanalytic work, as referred to 
in Par. 2, is connected with this determination of the general system. The 
preparation of the text, its rewriting in difforent forms, sometimes being 
rewritten in a half dozen ways, tho recording of letters, the establish¬ 
ment of frequencies of occurrences of letters, comparisons and experiments 
made with known material of similar character, and so on, constitute much 
labor that is most often indispensable, but v/hich sometimes turns out to 
have been wholly unnecessary, or in vain. In a rocent treatise'*' it is 
stated quite boldly that "this work onco done, the determination of tho 
system is often relatively easy." This statement can certainly apply only 
to tho simpler typos of ciphers; it is ontiroly misleading as regards the 
much more frequently oncountorod complex cryptograms of modern times. 

Lange et Soudart, already cited (page 106). 
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7. Tho reconstruction of the spocific koy. - a. Nearly all practical 
cryptographic methods require the use of a specific key to guide, control 
or modify the various steps under the general system. Once the latter has 
been disclosed, discovered, or has otherwise come into tho possession of 
the cryptanalyst, tho next step in solution is to determine, if necessary, 
and if possible, the spocific key that was employed to cryptograph the 
message or raossages under examination. This determination may not be in 
complete detail; it may go only so far as to load to a knowledge of tho 
number of alphabets involved in a substitution ciphor, or tho numbor of 
columns involved in a transposition ciphor, or that a ono-part code has 
been used, in fcho caso of a codo system. But it is ofton desirable to de- 
toraino tho spocific koy in as comploto a form and with ns much dotail as 
possible, for this information will very froquontly bo usoful in tho solu¬ 
tion of subsoquont cryptograms exchanged botwoon tho samo correspondents, 
since tho nature of tho spocific koy in a solved casu may bo expected to 
givo clues to tho specific koy in an unsolved case. 

b. Frequently, however, the reconstruction of the key is not a pre¬ 
requisite to, and does not constitute an absolutely necessary preliminary 
step in the fourth basic operation, the reconstruction or establishment of 
the plain text. In many cases, indeed, the two processes are carried along 
simultaneously, the one assisting the other, until in the final stages both 
have been completed in their entireties. In still other cases the recon¬ 
struction of the specific key may succeed instead of precede the reconstruc¬ 
tion of the plain text, and is accomplished purely as a matter of academic 
interest; or the specific key may, in unusual cases, never be reconstructed. 

3. The reconstruction of the plain text. - §.. Little need be said 
at this point on this phase of cryptanalysis. The process usually con¬ 
sists, in the case of substitution ciphers, in the establishment of equiva¬ 
lency between specific letters of the cipher text and the plain text, letter 
by letter, pair by pair, and so on, depending upon the particular type of 
substitution system involved. In the case of transposition ciphers, the 
process consists in rearranging the elements of the cipher text, letter by 
letter, pair by pair, or occasionally word by word, depending upon the par¬ 
ticular typo of transposition system involved, until the letters have been 
returned to their original plain-text order. In the case of code, the pro¬ 
cess consists in determining the meaning of each code group and inserting 
this moaning in the code text to reestablish the original plain text. 

b. The foregoing processes do not, as a rule, begin at the beginning 
of a message and continue letter by letter, or group by group in sequence 
up to the very end of the message. The establisnment of values of cipher 
letters in substitution methods, or of the positions to which cipher let¬ 
ters should be transferred to form the plain text in the case of transposi¬ 
tion methods, comes at very irregular intervals in the process. At first 
only one or two values scattered here and there throughout the text may ap¬ 
pear; these then form the "skeletons” of words, upon which further work, 
by a continuation of the reconstruction process, is made possible; in the 
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end the complete or nearly complete^ toxt is established. 

In the case of cryptograms in a foreign language, the transla¬ 
tion of the solved messages is a final and necessary step, but is not to 
be considered as a cryptanalytic process. However, it is commonly the 
case that the translation process will be carried on simultaneously with 
the cryptanalytic, and will aid the latter, especially when there are 
lacunae which may be filled in from the context. (Soo also Par. 5 c in 
this connection.) 


Sometimes in the caso of code, tho meaning of a few code groups may be 
lacking, because thore is insufficient toxt to establish their meaning. 
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FRE'UUIICY DIST-II3UTI01IS 


Paragraph 


The simple or monoliteral-frequency distribution . 9 

Important features of the normal monoliteral frequency, bar- 

distribution.. . -.10 

Constancy of the standard or normal monoliteral-frequency 

distribution.11 


9. The simple or monoliteral-frequency distribution. - a. It has 
long been known to cryptographers and typographers that the letters com¬ 
posing the words of any intelligible written text composed in any language 
which is alphabetic in construction are employed with greatly varying fre¬ 
quencies. For example, if on cross-section paper a simple graph, shown in 
Fig. 1, called a monoliteral frequency, bar-distribution , is made of the 
letters comprising the words of the preceding sentence, the variation in 
frequency is strikingly demonstrated. It is seen that whereas certain 
letters, such as a, IJ, L, H, 0, R, S, and T, are employed very frequently, 
other letters, such as C, 9, P, and are employed not nearly so frequently, 
while still other letters, such as F, -J, '2, V, and Z are employed either 
seldom or not at all. 



A3CD3FGHIJkLLiN0r'2R&TUV / X l 2 
14 3 8 4 22 2 9 10 15 0 1 9 3 IT 14 8 1 13 10 20 3 1 5 1 7 0 

( Total. 200 letters) 

Fig. 1 

b. If a similar graph is now made of the letters comprising the v/ords 
of the second sentence in the preceding paragraph, the graph shown in Fig. 2 
is obtained. Both sentences have exactly the same number of letters (200). 



A 3 0 J 2 F G H I J K L ii 11 0 P Q R S T U V './ X Y Z 

12 2 8 7 25 7 5 5 19 0 1 8 6 16 14 7 2 16 14 15 3 1 2 1 3 0 

Fig. 2 
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c. Although each of these two graphs exhibits great variation in the 
relative frequencies with which different ‘letters are employed in the sen¬ 
tences to which they apply, no marked differences are exhibited between 
the frequencies of the same letter in the two graphs. Compare, for ex¬ 
ample, the frequencies of B, C, ... in Fig. 1 with those of A, B, C, 

111 in Fig. 2. Aside from one or two exceptions, as in the case of the 
letter F or the letter W, these two graphs agree rather strikingly. 

d. This agreement, or similarity , would be practically complete if 
the two texts were much longer, for example, five times as long. In fact, 
when two texts of similar character, each containing more than 1000 let¬ 
ters, are compared, it would be found that the respective frequencies of 
the 26 letters composing the two graphs show only very slight differences. 
This means, in other words, that in normal text each letter of the alpha¬ 
bet occurs with a rather constan t or characteristic frequency which it 
fends to approximate, depending upon the length of the text analyzed. 

the longer the text (within certain limits), the closer will be the ap¬ 
proximation. 1 

e. An experiment along these lines will be convincing. A series of 
260 official telegrams 2 passing through the '-'ar Department i.essage Center 
was examined statistically. The messages were divided into five sets, 
each totaling approximately 10,000 letters, and the five distributions 
shown in Table I were obtained. 

f. If the five distributions in Table 1 are summed, the results are 
as shown in Table 2. 

g. The frequencies noted in subparagraph f, when reduced to the basis 
of 1,000 letters and then used as a basis for constructing a simple chart 
that vail exhibit the variations in frequency in a striking manner, yield 
the following graph which is hereafter designated as the normal » or stand ¬ 
ard inonoliteral frequency , bar-distribution for English telegraphic plain 
text: 


1 


See footnote 1 to page 23. 


2 These comprised messages from several departments in addition to the 
^/ar Department, and we re all of an administrative character. 



WjrHlhHJlWWIfHlItlJIHJWJWrW WWWJfrUfWItiirtijrHJfHirtWfHJfHJtWfHJfW 

IIIHItHlfHlIHllHilHIIHlINJ 
• IIHIIHitWIl'tlllNJltU 
iwwi 

liwtwtwrwtwJTOiwwiitutHitwiwtHj 
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ABCDBFOHIJKLUHOPQRSTUVtfXrZ 
74 10 31 42130 28 16 34 74 2 3 36 25 79 75 27 3 76 61 92 26 15 16 5 19 1 

Fig. 3 

10. Impo-tant features of the noraal, monoliteral-frequency, bar- 
distribution. - a. YThen the graph shown in Fig. 3 is studied in detail, 
the following features are apparent > 

(1) It is quite irregular in appearance. This 
is because the letters are used with greatly varying 
frequencies, as discussed in the preceding paragraph. 

This irregular appearance is often described by say¬ 
ing that the graph shows marked crests and troughs , 
that is, points of high frequency and low frequency. 


Ill It+UrtUtfil 



REF ID*cA.64 644 


table'1-A 


Absolute frequencies of lott-.rs appearing in five sets of Govern¬ 
mental plain-text tolag’v.ms, Q^ch set containing 10,000 letters. 

Arranged alphabetically. 


Message 
No. 1 


Message 
No. 2 


Message 
No. 3 


Message 
No. 4 


Message 
No. 5 



© q 

© O 


-2 1 

S3 3 3 

U 2 a) 

s If 

» 5 3 

& 3 | 

+> o a* 

+s oo 1 

-P o &• 



© ,Q £ 

t-3 «! P=t 

® ,q u 

1-3 << Ph 

© & h 

^ <t |i| 


© P h 

t-3 < 

A - 738 

A - 785 

A - 681 

A - 740 

A - 741 

B - 104 

B - 103 

B - 98 

B - 33 

3-99 

C - 319 

G - 300 

C - 298 

C - 326 

C - 301 

D - 387 

D - 413 

D - 423 

D - 451 

D - 448 

E -1367 

E -1294 

E -1292 

E -1270 

E -1275 

F - 253 . ■ 

F - 287 

F - 308 

F - 287 

F - 281 

G - 166 

G - 175 

G - 161 

G - 167 

G - 150 

H - 310 

H - 351 

H - 335 

H - 349 

H - 349 

I - 712 

I - 750 

I - 737 

I - 700 

I - 297 

J - 18 

J - 17 

J - 10 

J - 21 

J - 16 

K - 36 

E - 38 

K - 22 

K - 21 

X - 31 

L - 365 

L - 393 

L - 533 

L - 38G 

L - 344 

M - 242 

M - 240 

la - 238 

Lf - 249 

M - 268 

N - 786 

N - 794 

N - 815 

N - 800 

N - 780 

0-685 • 

0 - 770 

0 - 791 

0 - 756 

0 - 762 

P - 241 

P - 2*72 

P - 317 

P - 245 

P - 260 

Q - 40 

G - 22 

C - 45 

Q - 38 

Q - 30 

R - 760 

R - 745 

R - 762 

R - 735 

R - 786 

S - 658 

S - 503 

S - 585 

S - 628 

S - 604 

T - 936 

T - 879 

T - 894 

T - 958 

T - 928 

U - 270 

U - 233 

U - 312 

U - 247 

U - 238 

V - 163 

V - 17? 

V - 142 

V - 133 

V - 155 

W - 166 

W - 163 ‘ 

7. - 136 

7/ - 133 

7. r - 182 

X - 43 

X - 50 

X - U 

X - 53 

X - 41 

Y - 191 

Y - 155 

Y - 179 

Y - 213 

Y - 229 

Z - 14 

Z - 17 

Z - 2 

Z - 11 

Z - 5 


Totals 

10,000 10,000 10,000 10,000 10,000 


Table 2-A 


Absblute frequencies of letters appearing in the combined five sets 
of messages totalling 50,000 letters arr 1 nged alphabetically. 


A - 3683 

G - 819 

L - 1821 

Q - 175 

V - 

766 

B - 48? 

H - 1694 

M - 1237 

R - 3788 

W - 

780 

C - 1534 

I - 3676 

N - 5975 

S - 3058 

X - 

231 

D - 2i22 

J - 82 

0 - 3764 

T - 4595 

Y - 

967 

H - '6498' 

K - 148 

P - 1335 

U - 1300 

Z - 

49 


F. - 1416 
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(2) The relative positions in which the crests and 
troughs fall with the graph, that is, the spatial relations 

of the crests and troughs, are rather definitely fixed and ■ 
are determined by circumstances which have been explained 
in a preceding text. 1 

( 3 ) The relative heights and depths of the crests and 
troughs within the graph, that is, the linear extensions of 
the lines marking the respective frequencies, are also rather 
definitely fixed, as would be found if an equal volume of 
similar text were analyzed. 

(4) The most prominent crests are marked by the vowels 
A, A, I, 0, and the consonants N, R, S, T; the most prominent 
troughs are marked by the consonants I, II, X, and Z. 


(5) The important data are summarized in tabular form 


TABLE 3 

6 Vowels. A3I0UY. . . 

Frequency 
. 398 

'/> of 
Total 

39.8 

/o of Total in 
Round Numbers 

40 


/ *>\, > ■“ 

[ 5 High Frequency . . . 

. 350 

35.0 

35 

20 Consonants^ 

(D H A S T) 

10 medium Frequency. . . 

. 238 

23.8 

24 

1 

| (3 C F G H L 11 j? V ’/) 

5 Low Frequency. . . . 

. 14 

1.4 

1 


' (J K Q X Z) 





Total. 

1000 

100.0 

100 


( 6 ) 

The 

frequencies of the letters 

of the alphabet 

are as 

follows. 






A - 74 


G - 16 

L - 36 

i - 3 

V - 15 

3-10 


H - 34 

li - 25 

R - 76 

U - 16 

C - 31 


1-74 

N - 79 

S - 61 

X - 5 

0-42 


J - 2 

0-75 

T - 92 

Y - 19 

2 -130 


il - 3 

F - 27 

U - 26 

Z - 1 

F - 28 






(7) 

The 

relative order 

of frequency of the letters 

is as 

follows. 






3 -130 


1-74 

C - 31 

Y - 19 

X - 5 

T - 92 


S - 61 

F - 28 

G - 16 

'< - 3 

H - 79 


0-42 

F - 27 

/ - 16 

K - 3 

A - 76 


L - 36 

U - 26 

V - 15 

J - .2 

0-75 


H - 34 

u - 25 

3-10 

Z - 1 

A - 74 






( 8 ) 

The 

four vowels A, 

A, I, 0 (combined frequency 

353) and 

the four 

consonants N, R, S 

, 'f (combined frequency 308) 

form 661 

out of every 

1,(700 letters 

of plain text 

5 in other words, less 


than l/3 of the Alphabet is employed in writing 2/3 of normal plain 
text. 

1 Section VII of Special Text No. 165, Elementary military Cryptography. 
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b. The data given in Fig. 3 and Table 3 represent the relative fre¬ 
quencies found in a large volume of English telegraphic text of a govern¬ 
mental, administrative character. These frequencies will vary somewhat 
with the nature of the text analyzed. For example, if an equal number of 
telegrams dealing solely with commercial transactions in the leather in¬ 
dustry were studied statistically, the frequencies would be slightly dif¬ 
ferent because of the repeated occurrence of words peculiar to that in¬ 
dustry. Again, if an equal number of telegrams dealing solely with 
military messages of a tactical character were studied statistically, the 
frequencies would differ slightly from those found above for general gov¬ 
ernmental messages of an administrative character. 

£. If ordinary English literary text (such as may be found in any 
book, newspaper, or printed document) were analyzed, the frequencies of 
certain letters would be changed to an appreciable degree. This is be¬ 
cause in telegraphic text words which are not strictly essential for in¬ 
telligibility (such as the definite and indofinite articles, certain prep¬ 
ositions, conjunctions and pronouns) are omitted. In addition, certain 
essential words, such as "stop”, "period", "comma", and the like, which 
are usually indicated in written or printed matter by symbols not easy 
to transmit telegraphically and which must therefore be spelled out in 
telegrams, occur very frequently. Furthermore, telegraphic text often 
employs longer and more uncommon words than does ordinary newspaper or 
book text. 


d. As a matter of fact, other tables compiled in the Office of the 
Chief Signal Officer gave slightly different results, depending upon the 
source of the text. For example, three tables based upon 75,000, 100,000, 
and 136,257 letters taken from various sources (telegrams, newspapers, 
magazine articles, books of fiction) gave as the relative order of fre¬ 
quency for the first 10 letters the following. 

For 75,000 letters s ETRNIOASDL 
For 100,000 letters : ETRINOASDL 
For 136,257 letters . ETRHAOISLD 

e. Frequency data applicable purely to printed military text were 
compiled by Hifct^-, from a study of 10,000 letters taken from orders and 
reports. The frequencies found by him are given in Tables 4 and 5. 

11. Constancy of the standard or normal, monolitoral-froquency 
distribution. - a. The relative frequencies disclosed by the statistical 
study of large volumes of text may be considered to be tho standard or 
normal frequencies of tho lottors of written English. Counts made of 
smaller volumes of text will tend to approximate these normal froquoncies, 


-*■ Loc. cit., pp. 6-7. 
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TABLE 4 

Froquoncy Table for 10,000 lrtt-rs of literary English, 
as compiled by Hitt. 


Alphabetically arrangod . 


A - 778 

G - 

174 

L - 372 

0 - 8 

V - 112 

B - 141 

H - 

595 

M - 288 

R - 651 

W - 176 

C - 296 

I - 

667 

N - 686 

S - 622 

X - 

27 

D - 402 

J - 

51 

0 - 807 

T - 855 

Y - 196 

E -1277 

F - 197 

K - 

74 

P - 223 

U - 308 

Z - 

17 


Arranged according to relative froquoncy . 


E -1277 

R - 651 

U - 308 

Y - 196 

K - 

74 

T - 855 

S - 622 

C - 296 

W - 176 

J - 

51 

0 - 807 

H - 595 

M - 288 

G - 174 

X - 

27 

A - 778 

£> - 402 

P - 223 

B - 141 

z - 

17 

iJ - 686 

L - 372 

F - 197 , 

V - 112 

Q - 

8 


I - 667 


Hitt also compilod data for telegraphic text (but does not stato what 
kind of massages) and givos the following table: 

TABLE 5 


Frequency Table for 10,000 letters of telegraphic English, 
as compiled by Hitt. 

Alphabetically arrangod . 


A - 813 

G - 201 

L - 

392 

Q - 38 

V 

- 

136 

B - 149 

H - 386 

J - 

273 

R - 677 

J 

- 

166 

C - 306 

I - 711 

N - 

718 

S - 656 

X 

- 

51 

D - 417 

J - 42 

0 - 

844 

T - 634 

Y 

- 

208 

E -1319 

F - 205 

K - 88 

P - 

243 

U - 321 

Z 

“ 

6 


Arranged according 

to relative 

freauencv. 




E -1319 

S - 656 

U - 

321 

F - 205 

K 


88 

0 - 844 

T - 634 

C - 

306 

G - 201 

X 

- 

51 

A - 813 

0 - 417 

il - 

273 

W - 166 

J 

- 

42 

H - 718 

L - 392 

P - 

243 

3 - 149 

Q 

- 

38 

I - 711 

R - 677 

H - 386 

Y - 

208 

V - 136 

z 

- 

6 
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and, within certain limits,the smaller the volume, the lower will be 
the decree of approximation to tho normal, until, in the case of a very 
short message, the normal proportions may not obtain at all. It is ad¬ 
visable that the student fix this fact firmly in mind, for the sooner he 
realizes the true nature of any data relative to the frequency of oc¬ 
currence of letters in text, the less often will his labors toward the 
solution of specific ciphers be thwarted and retarded by too strict an 
adherence to these generalized principles of frequency. He should con¬ 
stantly bear in mind that such data are merely statistical generalizations, 
that they will be found to hold strictly true only in large volumes of 
text, and that they may not even be approximated in short messages. 

b. Nevertheless the normal frequency standard or the "normal ex¬ 
pectancy" for any alphabetic language is, in the last analysis, the best 
guide to, and the usual basis for, the solution of cryptograms of a cer¬ 
tain type. It is useful, therefore, to reduce the normal, monoliteral 
frequency, bar-distribution to a basis that more or less closely approx¬ 
imates the volume of text which the cryptanalyst most often encounters in 
individual cryptograms. As regards length of messages, counting only the 
letters in the body, and excluding address and signature, a study of the 
260 telegrams referred to in paragraph 9 shows that the arithmetical aver¬ 
age is 217 letters; the statistical mean, or weighted average 2 , however, 
is 191 lottors. These two results are, however, close enough together to 
warrant the statement that the average length of telegrams is approximately 
200 letters. The frequencies given in Par. 9 f have therefore been re¬ 
duced to a basis of 200 letters, and the following monoliteral-frequency 
distribution may be taken as showing the most typical distribution to be 
expected in 200 letters of telegraphic English text; 

^ ^ ^ ^ = ^ 

A3CDEFGHIJKLMN0PQRSTUVWXYZ 

T ____ 

x It is useless to go beyond a certain limit in establishing the normal-fre¬ 
quency distribution for a given language. As a striking instance of this 
fact, witness the frequency study made by an indefatigable German, Kaeding, 
who in 1898 made a count of the letters in about 11,000,000 words, totalling 
about 62,000,000 letters in German text. %en reduced to a percentage basis, 
and when the relative order of frequency was determined, the results he ob¬ 
tained differed very little from the results obtained by Kasiski, a German 
cryptographer, from a count of only 1060 letters. See Kaeding, " Haeufig- 
keitswoerterbuch ". Steglitz, 1898; Kasiski, " Die Geheimschriften und die 
Dechiffrir-Kunst ". Berlin, 1863. 

^ The arithmetical average is obtained by adding each different length and 
dividing by the number of different-length messages; the mean is obtained by 
multiplying each different length by the number of messages of that length, 
adding all products, and dividing by the total number of messages. 
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c. The student should take careful note of the appearance of the dis¬ 
tribution 1 shown in Fig. 4, for it will be of much assistance to him in the 
early stages of his study. The manner of setting down the tallies should 
be followed by him in making his own distributions, indicating every fifth 
occurrence of a letter by an oblique tally. This procedure almost auto¬ 
matically shows the total number of occurrences for each letter, and yet 
does not destroy the graphical appearance of the distribution, especially 
if care is taken to use approximately the same amount of space for each set 
of five tallies. Gross-section paper is very useful for this purpose. 


SECTION IV 

FUNDAMENTAL USES OF THE MONOLITERAL FREQUENCY DISTRIBUTION 

Paragraph 


The four facts which can be determined from a study of the 

monoliteral-frequency distribution for a cryptogram. . . 12 

Determining the class to which a cipher belongs . 13 

Determining whether a substitution cipher is monoalphabetic 

or polyalphabet ic. 14 

Determining whether the cipher alphabet is a standard, or a 

mixed cipher alphabet. .. 15 

Determining whether the standard cipher alphabet is direct 

or reversed. 16 


12. The four facts which can be determined from a study of the mono¬ 
literal-frequency distribution for a cryptogram. - a. The following four 
facts (to be explained subsequently) can usually be determined from an in¬ 
spection of the monoliteral frequency, bar-distribution for a given cipher 
message of average length, composed of letters: 

(1) VThether the cipher belongs to the substitution 
or the transposition class; 

(2) If to the former, whether it is monoalphabetic 
or polyalphabetic in character; 

1 The use of the terms "distribution" and "frequency distribution", in¬ 
stead of "table" and "frequency table", respectively, is considered ad¬ 
visable from the point of view of consistency with the usual statistical 
nomenclature. Then data are given in tabular form, with frequencies in¬ 
dicated by numbers, then they may properly be said to be set out in the 
fora of a table . '.Then, however, the same data are distributed in a chart 
which partakes of the nature of a graph, with the data indicated by hori¬ 
zontal or vertical linear extensions, or by a curve connecting points 
corresponding to quantities, then it is more proper to call such a graphic 
representation of the data a bar-distribution . 
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(3) If monoalphabetic, whether the cipher alpha¬ 
bet is a standard cipher alphabet or a mixed cipher 
alphabet; 

(4) If standard, whether it is a direct or re¬ 
versed standard cipher alphabet. 

b. For immediate purposes the first two of the foregoipg determina¬ 
tions are quite important and will be discussed in detail in the next two 
subparagraphs, the other two determinations will be touched upon very 
briefly, leaving their detailed discussion for subsequent sections of the 
text. 


13. Determining the class to which a cipher belongs. - a. The de¬ 
termination of the class to which a cipher belongs is usually a relatively 
easy matter because of the fundamental difference in the nature of trans¬ 
position and of substitution as cryptographic processes. In a transposi¬ 
tion ciphor the original letters of the plain text have moroly boon re¬ 
arranged, without any change whatsoever in their identities, that is, in 
the conventional values they have in the normal alphabet. Hence, the 
numbers of vowels (A, E, I, 0, U> Y), high-frequency consonants (D, N, R, 
S, T), medium-frequency consonants (B, C, F, G, H, L, M, P, V, ’.V), and 
low-frequency consonants (J, K, Q, X, Z) are exactly the same in the 
cryptogram as they are in the plain-text message. Therefore, the per¬ 
centages of vowels, high, medium, and low-frequency consonants are the 
same in the transposed text as in the equivalent plain text. In a sub¬ 
stitution cipher, on the'other hand, the identities of the original let¬ 
ters of the plain text have been changed, that is, the conventional values 
they have in the normal alphabet have been altered. Consequently, if a 
count is made of the various letters present in such a cryptogram, it 
will be found that the number of vowels, high, medium, and low-frequency 
consonants will usually be quite different in the cryptogram from what 
they are in the original plain-text message. Therefore, the percentages 
of vowels, high, medium, and low-frequency consonants are usually quite 
different in the substitution text from what thoy are in the equivalent 
plain text. From these considerations it follows that if in a specific 
cryptogram the percentages of vowels, high, medium, and low-frequency 
consonants are approximately the same as would be oxpectsd in normal 
plain toxt, the cryptogram probably belongs to the transposition class; 
if these percentages are quite different from those to be expected in 
normal plain toxt tho cryptogram probably belongs to the substitution 
class, 


b. In the precoding subparagraph tho word "probably" was emphasized 
by underscoring it, for thoro can be no cortainty in every case of this 
determination. Usually those percentages in a transposition ciphor are 
closo to tho normal porcontagos for plain toxt; usually , in a substitu¬ 
tion cipher, they are far different from tho normal percentages for plain 
toxt. But occasionally a cipher message is encountered which is difficult 
to classify with a reasonable degree of cortainty because tho message is 
too short for the gonoral principles of frequency to manifest themselves. 
It is clear that if in actual messages there woro no variation whatevor 
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from the normal vowel and consonant percentages given in Table 3, the de¬ 
termination of the class to which a specific cryptogram belongs would be 
an extremely simple matter. But unfortunately there is always some var¬ 
iation or deviation from the normal. Intuition suggests that as messages 
decrease in length there may be a greater and greater departure from the 
normal proportions of vowels, high, medium and low-frequency consonants, 
until in very short messages the normal proportions may not hold at all. 
Similarly, as messages increase in length there may be a lesser and lesser 
departure from the normal proportions, until in messages totalling a 
thousand or more letters there may be no difference at all between the 
actual and the theoretical proportions. But intuition is not enough, for 
in dealing with specific messages of the length of those commonly en¬ 
countered in practical work the question sometimes arises as to exactly 
how much deviation from the normal proportions may be allowed for in a 
cryptogram that shows a considerable amount of deviation from the normal 
and might still belong to the transposition rather than to the substitu¬ 
tion class; 

c. Statistical studies have been made on this matter and some graphs 
have been constructed thereon. These are shown in Charts 1-4 ,in the form 
of simple curves, the use of which will now be explained. Each chart con¬ 
tains two curves marking the lower and upper limits, respectively, of the 
theoretical amount of deviation (from the normal percentages) of vowels or 
consonants which may be allowable in a cipher believed to belong to the 
transposition class. 

d. In Chart 1, curve marks the lower limit of the theoretical 
amount of deviation from the normal number of vow&ls to be expected in a 
message of given length; curve Vg marks the upper limit of the same thing. 
Thus, for example, in a message of 100 letters in plain English there 
should be between 33 and 47 vowels (AEIOUY). Likewise, in Chart 2 curves 

and H2 mark the lower and upper limits as regards the high-frequency 
consonants. In a message of 100 letters there should be between 28 and 42 
high-frequency consonants (DlJAST). In Chart 3, curves and Jin mark the 
lower and upper limits as regards the medium-frequency consonants. In a 
message of 100 letters there should be between 17 and 31 medium-frequency 
consonants (iCDFGrHLif'V /). Finally, in Chart 4, curves L^ and Lg mark the 
lower and upper limits as regards the low-frequency consonants. In a 
message of 100 letters there should be between 0 and 8 low-frequency con¬ 
sonants (JKOXZ). In using the charts, therefore, one finds the point of 
intersection of the vertical coordinate corresponding to the length of the 
message, with the horizontal coordinate corresponding to (l) the number of 
vowels, (2) the number of high-frequency consonants, (3) the number of 
medium-frequency consonants, and (4) the number of low-frequency conson¬ 
ants actually counted in the message. If all four points of intersection 
fall within the area delimited by the respective curves t then the number 
of vowels, high, medium, and low-frequency consonants corresponds with the 
number theoretically expected in a normal plain-text message of the same 
length} since the message under investigation is not plain text, it follows 
that the cryptogram may certainly be classified as a transposition cipher. 
On the other hand', if one or more of these points of intersection falls 
outside the area delimited by the respective curves, it follows that the 
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crypto ;ratn is probably a substitution cipher. The distance that the point 
of intersection falls outside the area delinited by these curves is a more 
or less rough measure o‘f the inprobability of the cryptogram's being a 
transposition cipher. 

e. Sometimes a cryptogram is encountered which is hard to classify 
with certainty even with the foregoing aids, because it has been con¬ 
sciously prepared with a view to making the classification difficult. 

This can be done either by selecting peculiar words (as in "trick crypto¬ 
grams' 1 ) or by employing a cipher alphabet in which letters of approxi¬ 
mately similar normal frequ e ncies have been interchanged. For example, 

E may be replaced by 0, T by and so on, thus yielding a cryptogram 
giving external indications of being a transposition cipher but which is 
really a substitution cipher. If the cryptogram is not too short, a 
close study will usually disclose what has been done, as well as the 
futility of so simple a subterfuge. 

f. In the majority of cases, in practical work, the determination 
of the class to which a cipher of average length belongs can be made from 
a mere inspection of the message, after the cryptanalyst has acquired a 
familiarity with the normal appearance of transposition and of substitu¬ 
tion ciphers. In the former case, his eyes very speedily note many high- 
frequency letters, such as i , T, H, H, 0, and S, with th9 absence of low- 
.frequency letters, such as J, K, Q, X, and Zj in the latter case, his 
eyes just as quickly note the presence of many low-frequency letters, and 
a corresponding absence of the usual high-frequency letters. 

1 g. Another rather quickly completed test, in the case of the simpler 
varieties of ciphers, is to look for repetitions of groups of letters . As 
will become apparent very soon, recurrences of syllables, entire words and 
short phrases constitute a characteristic of all normal plain text. Since 
a transposition cipher involves a change in the sequence of the letters 
.composing a plain-text message, such recurrences are broken up so that the 
cipher text no longer will shovr repetitions of more or less lengthy se¬ 
quences of letters, hut if a cipher message does show many repetitions 
and these are of several letters in length, say over four or five, the 
conclusion is at once warranted that the cryptogram is most probably a 
substitution and not a transposition cipher, dovever, for the beginner in 
cryptanalysis, it will be advisable to make the monoliteral frequency, 
bar-distribution, -and note the frequencies of the vowels, the high, medium, 
and low-frequency consonants. Then, referring to Charts 1 to 4, he should 
carefully note whether or not the observed frequencies for these categories 
of, letters fall within the limits of the theoretical frequencies for a 
normal plain-text message of the same length, and be guidod accordingly. 

h. It is obvious that the foregoing rule applies only to ciphers com¬ 
posed, wholly of letters. If a message is composed entirely of figures, or 
of arbitrary signs and symbols, or of intermixtures of letters, figures and 
other symbols, it is immediately apparent that the cryptogram is a sub¬ 
stitution cipher 
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i. Finally, it should he mentioned that there are certain kinds of 
cryptograms whose class cannot be determined by the method set forth in 
subparagraphs b, c., d above. These exceptions will be discussed in a sub¬ 
sequent section of this text. 

14. Determining whether a substitution cipher is monoalphabetic or 
polyalphabetic. - a. It will be remembered that a monoalphabetic sub¬ 
stitution cipher is one in which a single cipher alphabet is employed 
throughout the whole message, that is, a given plain-text letter is in¬ 
variably represented throughout the message by one and the same letter in 
the cipher text. On the other hand, a polyalphabetic substitution cipher 
is one in which two or more cipher alphabets are employed within the same 
message; that is, a given plain-text letter may be represented by two or 
more different letters in the cipher text, according to some rule govern¬ 
ing the selection of the equivalent to be used in each case. From this 
it follows that a single cipher letter may represent two or more different 
plain-text letters. 

■b. It is easy to see why and how the appearance of the monoliteral" - 
frequency distribution for a substitution cipher may be used to determine 
-whether the cryptogram is monoalphabetic or polyalphabetic in character. 

The normal distribution presents marked crests and troughs by virtue of 
two circumstances. First, the elementary sounds which the symbols repre¬ 
sent are used with greatly varying frequencies, it being one of the strik¬ 
ing characteristics of every alphabetic language that its elementary sounds 
are used with greatly varying frequencies.* In the second place, except 
for orthographic aberrations peculiar to certain language (conspicuously, 
English and French), each such sound is represented by the same symbol. 

It follows, therefore, that since in a monoalphabetic substitution cipher 
each different plain-text letter ( = elementary sound) is represented by 
one and only one cipher letter ( = elementary symbol), the monoliteral-fre¬ 
quency distribution for such a cipher message must also exhibit the ir¬ 
regular crest and trough appearance of the normal distribution, but with 
only this important modification- the absolute positions of the crests 
and troughs wi ll not be the same as in the normal . That is, the letters 
accompanying the crests and the troughs in the distribution for the crypto¬ 
gram will be different from those accompanying the crests and the troughs 
in the normal distribution. But the marked irregularity of the distribu¬ 
tion, the presence of accentuated crests and troughs, is in itself an in¬ 
dication that each symbol or cipher letter always represents the same 
plain-text letter in that cryptogram. Hence the general rule. A marked 
crest and trough appearance in the raonoliteral-frequency distribution for 
a--given cryptogram indicates that a single cipher alphabet is involved and 
constitutes one of the tests for a monoalphabetic substitution cipher . 

1 Par. 47. 

^ -The student who is interested in this phase of the subject may find the 
following reference of value. Zipf, G. K. "Selected Studies of the Prin¬ 
ciple of Relative Frequency in Language." Cambridge, *..ass., 1932. 
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c. On the other hand, suppose that in a cryptogram each cipher let¬ 
ter represents several different plain-text letters. Some of them are of 
high frequency, others of low frequency. The net result of such a situa¬ 
tion, so far as the monoliteral frequency distribution for the cryptogram 
is concerned, is to prevent the appearance of any marked crests and troughs 
and to tend to reduce the elements of the distribution to a more or less 
common level. This imparts a "flattened out" appearance to the distribu¬ 
tion. For example, in a certain cryptogram of polyalphabetic construc¬ 
tion, K c = 3p, Gp, and Jpj Rc = Ap, Dp, and Bpj X c = Op, Lp, and Fp. The 
frequencies of K c , Rq and X Q will be approximately equal because the sum¬ 
mations of the frequencies of the several plain-text letters each of these 
cipher letters represents at different times will be about equal. If this 
same phenomenon were true of all the letters of the cryptogram, it is clear 
that the frequencies of the 26 letters, when shown by means of the ordin¬ 
ary raonoliteral frequency distribution, would show no striking differences 
and the distribution would have the flat appearance of a typical polyalp’na- 
betic substitution cipher. Hence, the general rule The absence of 
mark ed crests and troughs in the monoliteral-frequency distribution in ¬ 
dicates that two or more cipher alphabets are involved. The flattened-out 
appearance of the distribution constitutes one of the tests for a poly- 
alphabetic substitution cipher. 

d. The foregoing test based upon the appearance of the frequency 
distribution constitutes only one of several means of determining whether 
a substitution cipher is monoalphabetic or polyalphabetic in composition. 

It can be employed in cases yielding frequency distributions from which 
definite conclusions can be drawn with more or less certainty by mere 
ocular examination. In those cases in which the frequency distributions 
contain insufficient data to permit drawing definite conclusions by such 
examination, certain statistical tests can be applied. These will be dis¬ 
cussed in a subsequent text. 

e. At this point, however, one additional test will be given because 
of its simplicity of application. It may be employed in testing messages 
up to 200 letters in length, it being assumed that in messages of greater 
length ocular examination of the frequency distribution offers little or 

‘no difficulty. This test concerns the number of blanks in the frequency 
distribution, that is, the number of letters of the alphabet which are en¬ 
tirely absent from the message. It has been found from statistical studies 
that rather definite "laws" govern the theoretically expected number of 
blanks in normal plain-text messages and in frequency distributions for 
cryptograms of different natures and of various sizes. The results of 
certain of these studies have been embodied in Chart 5. 

f. This chart contains two curves. The one labeled P applies to the 
average number of blanks theoretically expected in frequency distributions 
based upon normal plain-text messages of the indicated lengths. The other 
curve, labeled R, applies to the average number of blanks theoretically ex¬ 
pected in frequency distributions based upon perfectly r andom assortments 
of letters, that is, assortments such as would be found by random selec¬ 
tion of letters out of a hat containing thousands of letters, all of the 
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25 letters of the alphabet being present in equal proportions, each let¬ 
ter being replaced after a record of its selection has been made. Such 
random assortments correspond to polyalphabetic cipher messages in which 
the number of cipher alphabets i3 so large that if uonoliteral-frequency 
distributions are made of the letters, the distributions are practically 
identical with thoso which are obtained by random selections of letters 
out of a hat. 

g. In using this chart, one finds the point of intersection of the 
vertical coordinate corresponding to the length of the message, with the 
horizontal coordinate corresponding to the observed number of blanks in 
the monoliteral-frequency distribution for the message. If this point of 
intersection falls closer to curve P than it does to curve R, the number 
of blanks in the message approximates or corresponds more closely to the 
number theoretically expected in a plain-text message than it does to a 
random cipher-text message of the same length? therefore, this is evidence 
that the cryptogram is monoalphabetic. Conversely, if this point of inter¬ 
section falls closer to curve R than to curve P, the number of blanks in 
the message approximates or corresponds more closely to the number theo¬ 
retically expected in a random text than it does to a plain-text message 

of the same length? therefore, this is evidence that the cryptogram is 
polyalphabetic. 

h. Practical examples of the use of this chart will be given in some 
of the illustrative messages to follow. 

15. Determining whether the cipher alphabet is a standard, or a mixed 
cipher alphabet. - a. Assuming that the monoliteral-frequency distribu¬ 
tion for a given cryptogram has been made, and that it shows clearly, that 
the cryptogram is a substitution cipher and is monoalphabetic in character, 
a consideration of the nature of standard cipher alphabets 1 almost makes 

it obvious how an inspection of the distribution will disclose whether the 
cipher alphabet involved is a standard cipher alphabet or a mixed cipher 
alphabet. If the crests and troughs of the monoliteral-frequency distribu¬ 
tion occupy positions which correspond to the relative positions they oc¬ 
cupy in the normal-frequency distribution, then the cipher alphabet is a 
standard cipher alphabet. If this is not the case, then it is highly prob¬ 
able that the cryptogram has been prepared by the use of a mixed cipher 
alphabet. 

b. A mechanical test may be applied in doubtful cases arising from 
lack of material available for study. Just v/hat this test involves, and 
an illustration of its application will b8 jiven in the next section, using 
specific examples. 

16. Determining whether the standard cipher alphabet is direct or 
reversed. - Assuming that the monoliteral-frequency distribution for a 
given cryptogram shows clearly that a standard cipher alphabet is involved, 
the determination as to whether the alphabet is direct or reversed, can also 

See Phr. 41, Special Text No. 165, Slementary military Cryptography. 
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be made by inspection, since the difference between the two is merely a 
matter of the dire ct ion in which the sequence of crests and troughs pro- 
gressesi to the right, as in normal reading or writing, or the left. 

In a direct cipher alphabet the direction in i,iiich the crests and troughs 
of the monoliteral-frequency distribution should be read is the normal 
direction, from left to right; in a reversed cipher alphabet this direc¬ 
tion is reversed, from right to left. 


SECTION V. 

.tOiJOLIT JRaL SUBSTITUTION WITH ST;i,D.-rU) CIPHER .iLPHaBUTS 


Paragraph 

Principles of solution by construction and analysis of the 

monoliteral-frequency distx-ibution. 17 

Theoretical example of solution . 13 

Practical example of solution by the frequency method ..... 19 

Solution by completing the plain-component sequence . 20 

Special remarks on the method of solution by completing the plain- 

component sequence. 21 

Value of mechanical solution as a short cut. 22 


17. Principles of solution by construction and analysis of the mono¬ 
literal-frequency distribution. - a. Standard cipher alphabets are of two 
sorts, direct and reversed. The analysis of nonoalphabetic cryptograms 
prepared by their use follows almost directly from a consideration of the 
nature of such alphabets. Since the cipher component of a standard cipher 
alphabet consists either of the normal sequence merely displaced 1, 2, 3 
... intervals from the normal point of coincidence, or of the normal se¬ 
quence proceeding in a reversed-normal direction, it is obvious that the 
raonoliteral-frequency distribution for a cryptogram prepared by means of 
such a cipher alphabet employed monoalphabetically will show crests and 
troughs whose relative positions and frequencies will be exactly the same 
as in the monoliteral-frequency distribution for the plain text of that 
cryptogram. The only thing that has happened is that the whole set of 
crests and troughs of the monoliteral-frequency distribution has been dis¬ 
placed to the right or left of the position it occupies in the monoliteral- 
frequency distribution for the plain text; or else the successive elements 
of the whole set progress in the opposite direction. Hence, it follows 
that the correct determination of the plain-text value of the letter mark- 
ing any crest or trough of the nonoliteral-freauency distribution will re¬ 
sult at one stroke in the correct determination of the plain-text values 
of all the remaining 25 letters respectively marking the other crests and 
troughs in that distribution. Thus, having determined the value of a 
single element of the cipher component of the cipher alphabet, the values 
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of all the remaining letters of the cipher component are automatically- 
solved at one stroke. In more simple language, the correct determination 
of the value of a single letter of the cipher text automatically gives the 
values of the other 25 letters of the cipher text. The problem thus re¬ 
solves itself into a matter of selecting that point of attack which will 
most quickly or most easily lead to the determination of the value of one 
cipher letter. The single word identification will hereafter be used for 
the phrase "determination of the value of a cipher letter"; to identify a 
cipher letter is to find its plain-text value. 

b. It is obvious that the easiest point of attack is to assume that 
the letter marking the crest of greatest frequency in the monoliteral- 
frequency distribution for the cryptogram represents E » Proceeding from 
this initial point, the identifications of the remaining cipher letters 
marking the other crests and troughs are tentatively made on the basis 
that the letters of the cipher component proceed in accordance with the 
normal alpiabetic sequence, either direct or reversed. If the actual 
frequency of each letter marking a crest or a trough approximates to a 
fairly close degree the normal theoretical frequency of the assumed plain¬ 
text equivalent, then the initial identification 0 C = 3 p may be assumed to 
be correct and therefore the derived identifications of the other cipher 
letters may be assumed to be correct. If the original starting point for 
assignment of plain-text values is not correct, or if the direction of 
"reading" the successive crests and troughs of the monoliteral-frequency 
distribution is not correct, then the frequencies of the other 25 cipher 
letters ’dll not correspond to or even approximate the normal theoretical 
frequencies of their hypothetical plain-text equivalents on the basis of 
the initial identification. A new initial point, that is, a different 
cipher equivalent must then be selected to represent E p ; or else the direc¬ 
tion of "reading" the crests and troughs must be reversed. This procedure, 
that is, the attempt bo make the actual frequency relations exhibited by 
monoliteral-frequency distribution for a given cryptogram conform to the 
theoretical frequency relations of the normal-frequency distribution in an 
effort to solve the cryptogram, is referred to technically as "fitting the 
actual monoliteral-frequency bar distribution for a cryptogram to the theo¬ 
retical monoidteral-frequency bar distribution for normal plain text", or, 
more briefly, as " fitting the frequency distribution for the cryptogram to 
the normal-frequency distribution ," or, still more briefly, " fitting the 
distribution to the normal ." In statistical work the expression commonly 
employed in connection with this process of fitting an actual distribution 
to a theoretical one is "testing the goodness of fit". The closeness of 
the degree of goodness of fit may be stated in various ways, mathematical 
in character. 

c. In fitting the distribution to the normal, it is necessary to re¬ 
gard the cipher component (that is, the letters a ... Z marking the succes¬ 
sive crests and troughs of the monoliteral-frequency distribution) as par¬ 
taking of the nature of a wheel or sequence closing in unon itself, so that 
no matter with what crest or trough one starts, the spatial and frequency 
relations of the crests and troughs are constant. This manner of regard¬ 
ing the cipher component as being cyclic in nature is valid because it is 
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obvious that the relative positions and f requencies of the crests and 
troughs of any monoliteral-frequency distribution must remain the same 
regardless of what letter is employed as the initial point of the dis¬ 
tribution . Fig. 5 gives a clear picture of what is meant in this con¬ 
nection, as applied to the normal-frequency distribution. 



Fig. 5 

d, In the third sentence of subparagraph b, the phrase "assumed to 
be correct" was advisedly employed in describing the results of the at¬ 
tempt to fit the distribution to the normal, because the final test of 
the goodness of fit in this connection (that is, of the correctness of 
the assignment 'of values to the crests and troughs of the monoliteral- 
frequency distribution) is whether the consisten t substitution of the 
plain-text values of the cipher characters in the cryptogram will yield 
intelligible plain text. If this is not the case, then no matter how 
close the approximation between actual and theoretical frequencies is, no 
matter how well the monoliteral-frequency distribution fits the normal, 
the only possible inferences are that (l) either the closeness of the fit 
is a pure coincidence in this case, and that another equally good fit may 
be obtained from the same data, or else (2) the cryptogram involves some¬ 
thing '.lore than simple raonoalphabetic substitution by means of a single 
standard cipher alphabet. For example, suppose a transposition has been 
applied in addition to the substitution. Then, although an excellent 
correspondence between the monoliteral-frequency distribution and the 
normal-frequency distribution has been obtained, the substitution of the 
cipher letters by their assumed equivalents will still not yield plain 
text, .lowever, aside from such cases of double encipher.lent, instances 
in which the monoliteral-frequency distribution may be easily fitted to 
the normal-frequency distribution and in which at the same time an attempted 
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simple substitution fails to yield intelligible text are rare. It may be 
said that, in practical operations whenever the monoliteral-frequency 
distribution can be male to fit the normal-frequency distribution, sub¬ 
stitution of values ’-/ill result in solution; and, as a corollary, whenever 
the monoliteral-frequency distribution cannot be made to fit the normal* 
frequency distribution, the cryptogram does not represent a case of simple, 
raonoalphabetic substitution by means of a standa'd alphabet. 

13. Theoretical example of solution. - a. The foregoing principles 
will become clearer by noting the cryptographing and solution of a theo¬ 
retical example. The following message is to be cryptographed. 

' HOSTILE FORCE’ESTE ATED .,T ONE AEGIaSHT INFANTRY 

AND T - r 0 PLATOONS CAVaLRY :0VING bOUTH 01 T -jUIiiNL ONT 
PIK3 STOP iEA D OF C0LU..-1 HEALING ROAJ JUNCTION SEVEN 
THREE SEVEN CttuiA EAST OF GRiSSNACRE SCHOOL FI A JO UPON 
JY OUR rAf AOLS STOP II*VE JHoTROfED 3AIJGE OVER INJI.lN 


CIL 


EK 


b. First, solely for purposes of demonstrating certain principles, 
the nonoliteral-frequency distribution for this message is presented in 
Fig. 6. 
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A3-CDEFGH IJKLi.NOF'jRS 
Fig. 6 


U V 7 X 


Y Z 


£. Now let the foregoing message be cryptographed inonoalphabetically 
by the following cipher alphabet, yielding the cryptogram and the mono- 
literal-frequency distribution shown below. 


Plain. ASCDEFGHIJiZL. il 0 r ] R o T U V / X Y Z 
Cipher. GHIJKL.. HOP | 3 b T U V / X I Z A 3 C D 3 F 
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Fig. 7 - 

d. Let the student now compare Figs. 6 and 7, which have been super¬ 
imposed in Fig. 8 for convenience in examination. Crests and troughs ar9 
present in both distributions; moreover, their relative positions and fre¬ 
quencies have not been changed in the slightest particular. Only the absolute 
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position *f the sequence as a whole has been displaced six intervals t» 
the right in Fig. 7, as compared with the absolute position »f tho sequence 
in Fig. 6. 


as- =M 



ABCDEFGHIJKLMN OPQRSTUVWXY.Z 



ABCDEFGHIJKLMN OPQRSTUVWXYZ 
Fig. 8 

e. If the two distributions are compared in detail the student will 
clearly understand how easy the solution of the cryptogram would be to one 
who knew nothing about how it was prepared. For example, the frequency »f 
the highest crest, representing E in Fig. 6 is 28; at an interval of three 
letters before E there is another crest representing A with frequency 16. 
Between A and E there is a trough, representing the low-frequency letters 
B, G, D. On the other side of E, at an interval of three letters, <jomes 
another crest, representing I with frequency 14. Between E.and I there is 
another trough, representing the low-frequency letters F, G, H. Compare 
these crests and. troughs with their homologous crests and troughs in Fig. 

7. In the latter, the letter K marks the highest crest in the raonoliteral- 
frequency distribution with a frequency of 28; three letters before K there 
is another crefct, frequency 16, and three letters on the other side of K 
there is another crest, frequency 14. Troughs corresponding to B, C,. D and 
F, G, H are seen at^H, I, J and L, M, N in Fig. 7. In fact, the two dis¬ 
tributions may be made to coincide exactly, by shifting the monoliteral- 
frequency distribution for the cryptogram six intervals' to the left with 
respect to the monoliteral-frequency distribution for the equivalent plain¬ 
text message, as shown herewith. 
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ABGDJFGHIJKLMN OPQRSTUVWXYZ 





GHIJKLMNOPQRSTUVWXYZABCDEF 
Fig. 9 


f. Let us suppose new that nothing is knewn about the cryptographing 
process, and that only the cryptogram and its monoliteral-frequency distribu¬ 
tion is at hand. It is clear that simply bearing in mind the spatial relations 
of the crests and troughs in a normal-frequency distribution would enable the 
cryptanalyst to fit the moncliteral-frequency distribution to the normal- 
frequency distribution in this case. He would naturally first assume that 

G c - A , from which it would follow that if a direct standard alphabet is 
involved, H q - B , I c - G_, and so on, yielding the following (tentative) 
deciphering alphabet : 

Cipher: ABCDEFGHIJKLMN OPQRSTUVWXYZ 
Plain s UVIXYZABGDEFGHIJKLMNOFQRST 

g. Now comes the final test: If these assumed values are substituted 
in the cipher text, the plain text immediately appears. Thus: 

NUYZO RKLUX IKKYZ OSGZK JOZUT etc. 

HOSTI LEFOR GEEST IMATE DATON etc. 

h. It should be clear, therefore, that the selection of G c to repre¬ 
sent A in the cryptographing process has absolutely no effect upon the 
relative spatial and frequency relations of the crests and troughs of the 
monoliteral-frequency distribution for the cryptogram. If Q had been selected 
to represent A , these relations would still remain the same, the whole series 
of crests and troughs being merely displaced further to the right of the 
positions they occupy when G Q - A . 
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19. Practical example of solution by the frequency method. - a. The 
cas e of direct stan dard aJLjghabejt ciph ers. 

(1) The folio dig cryptogram is to be solved by ap¬ 
plying the foregoing principles 

IBM j u P B I IJ 0 M B B 8 A JCZOF .. U U '< B A J. C ,.Z 0 

Z 17 I .L N . T T M L 3 ] B P 8 I Z K P T 7 0 7 J I V 3 2 G 

(2) From the presence of repetitions ana so many 

low-frequency letters such as 3, 3, and Z it is at once 

suspected that this is a substitution cipher. ->ut to 
illust-ate %he stsps that must be taken in difficult 
cases in ordir to be certain in this respect, a uono- 
literal-frequency distribution is constructed, and then 
reference is mnae to charts 1 to 4 to note here the 
actual numbers of vowels, high, medium, and lov-fre- 
ouency letters fall insiue or outside the a eas de¬ 
limited by the respective curves. 


= ^ = _ — zr m . =: m 

;,*ai 

— ■=. _ 

.2 

3 C D 3 F 3 H I J i: L i. 

NOP J 

ft S T U V 1 X Y 

Z 

Fig. 10 

a. 

Frequency 

rosition with respect to 
areas dali it 3 d by curves 

.'0 ;sls .1 5 I 0 u : 

17 

outsiue, Cha-t 1 


.ligh-frequency Connoiants (u N R o T) 

4 

outsiae, Cha -t 2 


I-ediu 1 -freouenc j Consonants 
'OB C F j I u id r 7 1 1) 

2 b 

outsine, Chart 3 


Low-frequency Consonants (J .. Q X 2) 

• 14 

outside, Chart 4 


Total 

oO 




(3) /ill four points falling quite outside the 
areas delimited by the curves applicable to these four 
classes of letters, the crvptogra . is clearly a sub¬ 
stitution ci.'ier. 

(4; The appearance of the frequency distribution, 
with .jarred crests and troughs, indicates that the 
cr/ptogran is probably lonoali-habetic. -reference is 
now .iade to Chart 5. The message has 60‘letters and 
6 blanks. Tie point of intersection on the chart is 
closer to curve P than it is to curve C, therefore, 
this is additi-inal evidence that the message is prob¬ 
ably monoalphabetic. 

‘ (5) The 'riext step is to determine whether a stand¬ 

ard or a mixed cipher alphabet is involved. This is- 
done by studving the sequence of crests and trou;hs in 
the mo.ioliteral-frequenc j dist ibution, and t ring to 
fit the distribution to the normal. 
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(6) The fi”*st assumption to be made is that a 
direct standard is involved. The highest crest in 
the aistrioution is marked by B c . Let it be assumed 
that B c = 3 pi Then C c , j) c , 3 C , ... = F ? , G p , H p , thus. 

— 

Cipher ,i B C J ul F S .1 I J K LU N U PQRSTUVtfXYZ 
Plain . D 3FGHIJKLLHOP'JRSTUVT7XYZABC 
Fig. 10 la. 

at first glance the approximation to the expected fre¬ 
quencies seems fair, especially in the region F 0 H I 
J K and 1 S T p . 3ut there are too many occurrences 
of Lp, Pp, X p and C p and too few occurrences of A p , X p , 
lip, Op. iioreovsr, if a substitution is attempted on 
this basis, the following is obtained for the first two 
cipher groups 

Cipher . I B Li Q o PBIUO 

"plain text" .L3PTR S3LXR 

This is certainly not plain text and it seems clear 
that 3 C is not E p . A different assumption will have 
to be made. 

(7) Suppose Q c = 3p. Going through the same 
steps as before, again no satisfactory results are 
obtained. Further trials-'- are made along the same 
lines, until the assumption id c = is tested. 

_ . .**_*! = £*§ # 

Cipher, a B C J 3 F G H I J IC L U 11 0 P Q R S T U V / X Y Z 
ilain ■ BTUVffZYZrtBCJSfGHl JKLLNOPQR 
fig. 10 c. 

(8) The fit in this case is quite good; possibly 
there are too many occurrences of G and II and two 
few of 3p, 0^ and S p . But the final test remains, 
trial of the substitution alphabet on the cryptogram 
itself. This is immediately done and the results are 
as follows. 


It is unnecessary, of course, to write out the alphabets as shown in 
Figs. 10 b and c when testing assumptions. This is usually all done 
mentally. 
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(9) It is always advisable to note the specific 
key. In this case the correspondence between any plain¬ 
text letter and its cipher equivalent will indicate the 
keyj it is usual, however, to indicate the key by not¬ 
ing the cipher equivalent of A p . In this case A p * I c . 

b. The case of reversed standard alphabet ciphers. - 


(1) Lot the following cryptogram and its mono¬ 
literal-frequency distribution be studied- 

I P 2 A C 3 P I W C jSPFKQ HORCL D’WWAP 1H 0 RC 

RUIFJ AXXEF M A P B ¥ IRGBA VCAVD IVPRK 

(2) The preliminary steps illustrated above, under 
subpar. a (l) to (4) inclusive, in connection with the 
test for class and monoalphabeticity, will here be 
omitted, since they are exactly the same in nature. 

The result is that the cryptogram is obviously a sub¬ 
stitution cipher and is monoalphabetic. 

(3‘) assuming tha't it is ’hot known whether a direct 
or a reversed standard alphabet is involved, attempts 
are at once made to fit the raonoliteral-frequency dis¬ 
tribution to the direct sequence. If the student will 
try them he will soon find out that these are unsuc¬ 
cessful. Jill this takes but a few minutes. 

(4) The next logical assumption is now made, viz., 
that the cipher alphabet is a reversed standard alpha¬ 
bet. ftien on this basis 3 C is assumed to be E p , the 
distribution can readily be fitted to the normal, 
practically every crest and trough in the actual dis¬ 
tribution corresponding to a crest or trough in the ex¬ 
pected distribution. 

f 2 Iss: . s S jft — — ~ 2= ' 2 £ ? £ 5 — 

Cipher- ABGD2FGHIJKL llOP'JRSTUVi/XYZ 
Plain - IHGF3DCBAZYX l V U T S A 0 p 0 II .. L II J 
Fig. 10 d 
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(5) '.Vhen the substitution is made in the crypto¬ 
gram, the following is obtained. 

Cryptogram. I P E A C BP Il/C E P P K Q ... 
Plain text i AT JIS HTAilG 3TTYS ... 

(6) The plain-text message is identical with that 
under paragraph a. The specific key in this case is 
also Ap = I c . If the student will compare the frequency 
distributions in the two cases, he will note that the 
relative positions and extensions of the crests and 
troughs are identical; they merely progress in opposite 
directions. 

20. Solution by completing the plain-component sequence. - 
The case of direct _standard a lpha bet ciph er s. - 

(1) The foregoing method of analysis, involving 
as it does the construction of a monoliteral-frequency 
distribution, \ as termed a solution by t he frequenc y 
method because it involves the construction of a fre¬ 
quency distribution and its studv. There is, however, 
another method which Is much more rapid, almost wholly 
mechanical, and which, moreover, does not necessitate 
the construction or study of any freouency dist-ibu- 
tion whatever. An understanding of the method follows 
from a consideration of the method of encipherment of 

a message by the use of a single, direct standard cipher 
alphabet. 

(2) Note the following encipherment. 

1-essage". REPEL INVADING CaVALRY 

Enciphering Alphabet 

Plain . « B C D B F G H I J K L U N 0 P ? R S T U V j X Y 
Cipher. GHIJKLMNOPQRSTUVWXYZABGDS 

Encipherment 

Plain text. REPEL INVAD IN GCA VALRY 

Cryptogram. XKVKR uTBGJ OTMIG BGRXE 

Cryptogram 

X u V I! R OTBGJ 0 T ■: I G BGRXE 

(3) The enciphering alphabet sho'/n above represents a 
case wherein the sequence of letters of both components 

of the cipher alphabet is the normal sequence, with the 
sequence forming the cipher component merely shifted six 
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intervals in retard (or 20 intervals in advance) of the 
position it occupies in the normal alphabet. If, there¬ 
fore, t'vo strips of paper bearing the letters of the 
normal sequence equally spaced are regarded as the two 
components of the cipher alphabet and are. juxtaposed 
at all of the 2b possible points of coincidence that 
/Leld direct standard cipher alphabets, it is obvious 
that one of these 25 juxtapositions must correspond to 
the actual juxtaposition 3hown in the enciphering alpha¬ 
bet directly above. 1 It is equally obvious that if a 
record were kept of the results obtained by applying 
the values given at each juxtaposition to the letters 
of the cryptogram, one of these result3 would yield 
th9 plain text of the cryptogram. 

(4) Let the work be systematized and the results 
set down in an orderly manner for examination. It is 
obviously unnecessary to juxtapose the two components 
so that A c = Ap, for on the assumption of a direct 
standard alphabet, juxtaposing two direct normal com¬ 
ponents at their normal point of coincidence merely 
yields plain text. The next possible juxtaposition, 
therefore, is A a 3_. Let the juxtaposition of the 
two sliding strips therefore be A c = 3 p , as shown heres 

rlain . ABCD3FGIIIJKLuHOP^lSTUVWXfZABCOBFGHIJIOi INOP'-.'RSTUWXfZ 
Ci phe r» ABCDjiFGHIJlILk 1I0PQR3TUV «f2 

The values given by this juxtaposition are substituted 
for the first 15 letters of the cryptogram and the 
following results are obtained* 

_ Cryptogr am* X K V K R 0TB tt J_0 T * I G _B jG R X B 

1st Test - "Plain text" - Y L V L S P U G H K~ P if II J H C H _ S Y F 

This certainly is not intelligible text; obviously, 
the two components were not in the position indicated 
in this first test. The cipher component is therefore 
slid one int3rval to the riyvt, making A c = Cp, and a 
second test is made. Thus. 

Plain . ABCDBFGiiIJKLi.NuP^RSTUWXYZi 1 BC£)BFGrfIJKIi..N01' jRSTUV XYZ 
Cipher AJCjJF.+ IILL. ‘.I0r 11STUV/X£Z 

_ Cryptogram. _- _X _K_V R __0 T 3 G J _0 T In I G 3 G R X 3 

2d Test - "rlain text" - Z li x"li“f r 1 V D I L r \ v"o IT ZG 

Neither does the second test result in disclosing any 
plain text. But, if the results pf the two tests are 
studied a phenomenon that at first seems quite pu*il- 
ing comes to li';ht. Thus, suppose the results of the 

_ two tests are superimpos ed in this fashion. _ 

1 One of the st dps shoulu bea ’ the sequence repeated. This allo\ s for 
juxtaposing the two ssquences at any of the 25 possible points of coinci¬ 
dence so as to have a complete cipher alphabet sho'in" at all times. 
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_C rvnto Tr am- _^_X _K V K R_0 T B G J 0 TJ I I G B SRU 

1st Test - "Plain text" -YLVLSPUCHK PUHJH CHSYF 
2nd Test - "Jrlain text" - Z M X M T QVDIL QVOKI DITZS 

(5) Note what has happened. The net result of the two 
experiments was merely to continue the normal sequence begun 
by the cipher letters at the heads of the several columns. 

It is obvious that if. the normal sequence is completed in 
each column the result s will be exactly the same as though 
the whole set of 25 possib le te sts _h_ad_ actually been per¬ 
formed . Let the columns therefore be completed, as shown 

ixi Fig. 11. 

XKYKROT BflJOTiilGBfl H X E 
YLWLS PUGHKPUNJHCHS Y F 
ZMXLTCjVDILQVOKIDITZG 
ANYHURUS JHRIJPLJ3JUAH 
3 0 Z 0 V 8 X F i; H S X Q ii K F K V B I 
CPAPUTYGLOTYRNLGLVCJ 
J) 5 B 5 X U Z H 1, P U Z S 0 li H i; X D K 
SRCRYVaIN IVATPillNYiSL 
FSDdZi/B JORUBOQO JOZFIi 
G T 3 T A X C K P S X C V R P K P A G N 
HUFU3YJLQTYDWSQLQBH0 
I7G VCZSHiUZiXTHURCIP 
Ji/HUDAFHSVAFYUSNSDJQ 
KXIXSBGOTl/BGZVTOTSKR 
LYJYFGHPUXGHATSril PUFLS 
LZKZGDIQVYDIBXV 2 V G lA T 
NALAHEJRVZ2SJ cywrwhnu 
-OBMBIFKSXAFKDZXSXIOV 
.PGHCJGL.TYBGL3AYTYJPiJ 
n , D 0 D K H i, B Z C H k F B Z U Z K 1 X 

* R 3 P 3 L I H V A J I H G C A T A L R Y 

& F q F ii J 0 './ B B J 0 H D B ; B M S Z 

TGRGJKPXCFKPI3CXCNTA 
D H S H 0 L q Y D G L q J F D Y J 0 U B 

V I T I P ji R Z B H M R K G J3 Z B P V C 

W J U J i N S A F I N S L H F A F Q 71 D 

Fig. 11 

An examination of the successive horizontal lines of the 
diagram discloses one and only one line of plain text, 
that marked by the asterisk and reading RS P j L I H 
V A D I U G C A V A L R Y. 

(6) Since each column in Fig. 11 is nothing but a 
normal sequence, it is obvious that instead of labori¬ 
ously writing down these columns of letters every time 

a cryptogram is to be examined, it would be more conven¬ 
ient to prepare a sbt of strips each bearing the normal 
sequence doubled (to permit complete coincidence for an 
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entire alphabet at any setting, and have them available 
for exa-lining any future cryptograms. In using such a 
set of sliding strips in order to solve a cryi'togram. 
prepared by means of a single direct stundurd cipher 
alphabet, or to make a test t'o date mine -hether a 
cryptogram has bean so prepared, it is only necessary 
to "set up" the letters of the cryptogram on the strips, 
that is, align them in a single row across the strips 
(by sliding the individual strips up or down). The 
successive horizontal lines, called gen era tric es 
(singular generatri x). are then examined in a search 
for intelligible text. If the cryptogram really belongs 
to this simple type of cipher, one of the generatrices 
will exhibit intelligible text - 'all the way across; 
this text will practically invariably be the plain text 
of the message. This method of analysis may be termed 
a so lut ion by comple ti ng t he plain-component sequenc e. 
Sometimes it is referred to as "running down" the se¬ 
quence. The principle upon which the method is based 
constitutes one of the cryptanalyst's most valuable 
tools. 1 

b. The case of reversed stand ard alphabets . - 

(1) The lethod described under subpar. a may also 
be applied ip slightly modified form, in the case of a 
cryptogram enciphered by a single reversed standard 
alphabet. The basic principles are identical in the 
two cases. 

(2) To show th^s it is necessary to experiment with 
two sliding components as before, except that in this 
case one of the components must be a reversed normal 
sequence, the other, a direct normal sequence. 

(3) Let the two components be juxtaposed A to A, 
as shown below, and then let the resultant values be 
substituted for‘the letters of the cryptogram. Thus. 

Cryptogram 

rCRCV YTL. CD Y T A 3 G LGVFI 

Plain . ABCDiiiS'GHIJKLl^OP'iASTUWXfZABCJJJGHIJKIijIlKjP^RSTUTv/XYZ 
Cipher: Z YXV/VUT SR^ PQNMLKJIi IGF3 DC BA 


It is recommended that the student prepare a set of 25 strips i-" x 
x 15", made of well-seasoned wood, and glue alphabet strips to the 
wood* The alphabet on each strip should be a double or repeated alpha¬ 
bet with all letters equally spaced, so as to permit of coincidence 
throughout a complete alphabet. 
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_ Cryptogra m _ _jJtFjT ^±33 Y T a j3 G_ L G V 1 I 

1st rest - "i lain te'xt" - L Y JY F~~ C H P U X C H A JO ' P U F L S 

(4) This does not yield intelligible text, and 
therefore the reversed component is slid one space 
forward and a second test is made. Thus. 

Plain . ABCJEFGxilJKU^OPQRSTUV /XYZABCDEFGHI Jf.Li .HQPQRSTUV.tt YZ 
Ci pho r ■- ZYX’ mJTS.l }P011i JL.KJIHGFEDCBA 

_Cr yptogram _ - PCRC V Y T L 5 3 _Y T A E G L G V P I 

2d Test - "i-lain text" - fa Z K Z G D I 3 V Y JIBXV IV GMT 

(5) Neither does the second test yield intelligible text. 
But let the results of the two tests be superimposed. Thus. 

_ Cryptogram _ - PC AC V_ Y T L G D Y T A j G L G V P I 

1st Test - "Plain text"* - L Y~ J Y F C H P U X C H .A 7 U P U F L S 

2nd Test - "Plain text" -f.ZKZG D I '■ V Y DIBXy QVGMT 

(6) It is seen that the letters of the "plain text" 
given by the seco nd trial are merely the continuants of 
the normal sequences initiated by the letters of the 
"plain text" given by the first trial. If these sequences 
are "run down", that is, completed within the columns, the 
results must obviously be the same as though successive 
tests exactly similar to the first two were applied to the 
cryptogram, using one reversed normal and one direct 
normal component. If the cryptogram has really been pre¬ 
pared by m^ans of a single reversed standard alphabet, one 
of the generatrices of the diagram that results from com¬ 
pleting the sequences must yield intelligible text. 

(7) Let the diagram be made, or.better yet, if the 
student has already at hand the set of sliding strips 
referred to in the footnote to page 44, let him "set up" 
the letters given by the first trial. Fig» 12 shows the 
diagram and indicates the plain-text generatrix. 
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Fig. 12 





(3) 

The 

only difference 

in procedure between this 

case and 

the 

preceding one 

(where the cipaer alphabet 

was a direct 

standard alphabet) is 

that the letters of 


the cipher text are first "deciphered" by means of any 
reversed standard alphabet and then the columns are 
"run down" according to the normal ABC...Z sequence. 

For reasons which will become apparent very soon, the 1 
first step in this method is called "converting the 
cipher letters into their plain-component equivalents"; 
the second step is the same as before "completing the 
plain-component sequences". 

21. Special remarks on the method of solution by completing the plain- 
component sequence. - a. The terms employed to designate the steps in the 
solution set forth in Par. 20 b, viz., "converting the cipher letters into 
their plain-component equivalents" and "completing the plain-component se¬ 
quence", accurately describe the process. Their meaning will become more 
clear as the student progresses with the work. It may be said that when¬ 
ever the plain component of a cipher alphabet is a known sequence, the dif¬ 
ficulty and time required to solve any cryptogram involving the use of that 
plain component is practically cut in half. In some cases this knowledge 
f acilitates and in ot her case s is the o nly th in; th at m akes possible the 
solution of a very sho r t cryptogra m that might ot he rwise defy solution . 
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Later on an example will be given to illustrate what is meant in this 
regard. 

b. The student should'take note, however, of two qualifying expres¬ 
sions that were employed in a preceding paragraph to describe the results 
of the application of the method. It was stated that "one of the gener¬ 
atrices will exhibit intelligible text a ll th e way a cr oss 1 ; this text will 
p ractica l ly in variably be the plain text". 'ill there ever be a case in 
which r.ore than one generatrix will yield intelligible text throughout 
its extant i That obviously depends almost entirely on the number of let¬ 
ters that are aligned to form a generatrix. If a generatrix contains but 
a very few letters, only five, for example, it may happen as a result of 
pure chance that there ’ri.ll be two or more generatrices shoring what 
might be "intelligible text", ilote in Fig. 11, for example, that there 
are several cases in which 3-letter and 4-letber English words (/ilJjf, VAIN, 
'JOT, TIP, etc.) appear on generatrices that are not correct, these words 
being formed by pure chance. Jut there is not a single case, in this 
diagram, of a 5-letter or longer word appearing fortuitously, because ob¬ 
viously the long?r the word the-smaller the probability of its appear¬ 
ance purely by chance; and the probability that two generatrices of 15 
letters each will both yield intelligible text along their entire length 
is exceedingly remote, so remote, in fact, that in practical cryptography 
such a case may be considered nonexistent. 

c. The student should observe that in reality there is no difference 
whatsoever in principle between the two methods presented in subpars. a 
and b of Par. 20. In the former the preliminary step of converting the 
cipher letters into their plain-component equivalents is apparently not 
present but in reality it is there. The reason for its apparent absence 

is that in that case the plain component of the cipher alphabet is identical 
in all respects with the cipher component, so that the cipher letters re¬ 
quire no conversion, or, rather, they are identical with the equivalents 
that would result if they were converted on the basis a c = Ap. In fact, 
if the solution process had been arbitrarily initiated by converting the 
cipher letters into their plain-component eciuivalents at the setting A = 
lip, for exaiple., and the cipher component slid one interval to the right 
thereafter, the results of the first and second tests of Far. 20 a would 
be this. 

Cryptogra m. ‘ _ - XKVKROTBGJOTiIIGBGRXS 

1st Test - ,l Flain text" - L I J fFCHPUXOHA / U F U F L -S 
2nd Test - Vlain text" - i. Z I! Z G H I ] V I J) I 3 X V Q V G iiT 

Thus, the foregoing-diagram duplicates in every particular the d-iagram re¬ 
sulting from the first two tests under Far. 20 a- a first line of cipher 
letters, a second line of letters derived from them but showing externally 
no relationship with the first line, and a third line derived immediately 
from the second line by continuing the direct normal sequence. This point 

1 A person with patience and an inclination toward the curiosities of the 
science might construct a text of 15 or more letters which would yield tw# 
"intelligible" texts on the plain-component completion diagram. 
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is brought to attention only for the purpose of showin-; that a single, 
broad principle is the basis of the general method of solution by complet¬ 
ing the plain-component sequence, and once the student has this firmly in 
mind he will have no difficulty whatsoever in realizing when the principle 
is applicable, what a powerful cryptanalytic tool it can be, and what re¬ 
sults he may expect from its application in specific instances. 

d. In the two foregoing examples of the application of the principle, 
the plain component was a normal sequence but it should' be clear to the 
student, if he has grasped what has been said in the preceding subpara¬ 
graph,, that this component may be a mixed sequence which if known (that is, 
if the sequence of letters comprising the sequence is known to the crypt¬ 
analyst) can be handled just as readily as can a plain component,that is a 
normal sequence. 

e. It is entirely immaterial at what points the plain and the cipher 
components are .juxtaposed in the preliminary step of converting the cipher 
letters into their plain-component equivalents. For example, in the case 
qf the reversed alphabet cipher solved in t'a.r. 20 b, the two components 
Were arbitrarily juxtaposed to give the value A = A, but they might have 
been juxtaposed at any of the other 25 possible points of coincidence 
without in any way affecting the final result, viz., the production of one 
plain-text generatrix in the completion diagram. 

22. Value of mechanical solution as a short cut. - a. It is obvious 
that the very first step the student should take in his attempts to solve 
an unknown cryptogram that is obviously a substitution cipher is to try 
the mechanical method of solution by completing the plain-component se¬ 
quence, using the normal alphabet, first direct, then reversed. This takes 
only a very fsw minutes and is conclusive in £.ts results. It saves the 
labor and trouble of constructing a monoliteral-frequency distribution in 
case the cipher is of this simple type. Later on it will be seen how cer¬ 
tain variations of this simple type may also be solved by the application 
of this method. Thus, a very easy short cut to solution is afforded, which 
even the experienced cryptanalyst never overlooks in his first attack on an 
unknown cipher. 

b. It is important nov/ to note that if neither o f th e two foregoing 
attempts is successful in bringing plain t ext to li ;ht and the cryptogram 
is quite obviously mon oalphabetic i_n_ character, the crypt ana lyst is war ¬ 
ranted in assuming that thecryptogram involves a mixed ci pher alphabet7 1 
The steps to be taken in attacking a cipher of the latter type will be 
discussed in the next" section. 


There is but one other possibility, already referred to under i-ar. 17 d 
which involves the case where transposition and monoalphabetic substitu¬ 
tion processes have been applied in successive steps. This is unusual, 
however, and will be discussed in its proper place. 
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S3CTI0W VI 


Mu^OLITB.IlL bUJSTITUflUU >TTH MI/Wi) CIPHER ALFHABBTS 

Paragraph 

Basic reason for the low degree of cryptographic security 

afforded by monoalphabetic cryptograms involving standard 
cipher alphabets.... • . 23 


Preliminary steps in the analysis of a monoalphabetic, mixed- 

alphabet cryptogram. ........... . ...» 24 

Further data concerning normal plain text ... 25 

Preparation of the work sheet . * . ........... 26 

Triliteral-frequency distributions. 2? 

Classifying the cipher letters into vowels and consonants .... 28 

Further analysis of the letters representing vowels and 

. consonants. 29 

Substituting deduced values in the cryptogram .......... 30 

Completing the solution. 31 

General remarks cm the foregoing solution . ..32 

The "probable-word" method? its value and applicability . 33 

Solution of additional cryptograms produced by the same cipher 


component.j.34 


23. Basic reason for the low degree of cryptographic security afforded 
by monoalphabetic cryptograms involving standard cipher alphabets. - a. The 
student has seen that the solution of monoalphabetic cryptograms involving 
standard cipher alphabets is a very easy matter. Two methods of analysis 
were described, one involving the construction of a frequency distribution, 
the other not requiring this kind of tabulation, being almost mechanical in 
.nature and correspondingly rapid. In the first of these two methods it was 
necessary to make a correct assumption as to the value of .but one of the 26 
letters of the cipher alphabet and the values of the remaining 25 letters 
become at once known; in the second method it was not necessary to assume a 
value for everi a single cipher letter. The student.-should understand what 
constitutes the basis "of this situation*, the fact that the two components 
of the cipher alphabet are composed of known Sequences , ./hat if one or 
both of these components are, for the cryptanalyst, unknown sequences ? In 
other words, what difficulties will confront the cryptanalyst if the cipher 
component of the cipher alphabet is a mixed sequence? ‘./ill such an alpha¬ 
bet be solvable as a whole at one stroke, or will it be necessary to solve 
its values individually? Since the determination of the value of one cipher 
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letter in this case gives no direct clues to the value of any other letter, 
i.t would seem that the solution of such a cipher should involve consider¬ 
ably more analysis and experiment than has the solution of either of the 
two types of ciphers so far examined occasioned. A typical example will 
be studied. 


24. Preliminary steps in the analysis of a monoalphabetic, mixed 
alphabet cryptogram.■- a. Note the following cryptogram: 


3FDZF IOGHL PZFG2 DYSPF H3ZDS GVHTF UPLVD FGYVJ VFVHT GADZZ AIT YD 

ZYI’ZJ ZTGPT VTZ 3D VFHTZ DF XS B GIDZY VTXOI YVT3F VliGZZ THLLV XZDFM 

HTZaI TYDZY 3 DVFH TZDFK ZDZZJ SXISG ZYGAV FSLGZ DTHHT CJZRS VTYZD 

OZFF H TZAIT YJZY G AVJGZ ZTKH I TY ZYb DZGIIU ZFZTG UPGJI X r GHX ASRUZ 

£FUIJ JJGHTV 3AGXX 


b. A casual inspection of the text discloses the presence of several 
long repetitions as well as of many letters of no'rmally low frequency, such 
q.b F, G, V, X, and Z; on the other hand, letters of normally high frequency, 
such as the vowels, and the consonants ii and R, are relatively scarce. The 
cryptogram is obviously a substitution cipher^- and the usual mechanical tests 
for determining whether it is possibly of the monoalphabetic, standard-alpha¬ 
bet type 2 are applied. The results being negative, the monoliteral-frequency 
distribution is immediately constructed and is as shown in Fig. 13. 



s ^ 

A3CD3FGHIJKLLTN0PO 
8 4 1 23 3 19 19 15 10 3 2 5 2 0 3 50 

Fig. 13 



m 

RSTUV7XYZ 
2 10 22 5 16 1 8 14 35 


£. The fact that the monoliteral-frequency distribution shows very 
marked crests and troughs means that the cryptogram is undoubtedly monoalpha¬ 
betic 5 the fact that it has already been tested (by the method of complet¬ 
ing the plain-component sequence) and found not to be of the monoalphabetic, 
standard-alphabet type, indicates with a high degree of probability that it 
involves a mixed cipher alphabet, A few moments might be devoted to making 
a careful inspection of the monoliteral-frequency distribution to insure that 
it cannot be made to fit the normal^ the object of this would be to rule out 
the possibility that the text resulting from substitution by a standard cipher 


Far. 13 f, g, above. 

2 Pars. 20 - 22 , above. 
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alphabet had not subsequently been transposed. But this inspection in this 
case is hardly necessary, in view of the presence of long repetitions in 
the message. (See Par. 13 g.) 

d. One might, of course, attempt to solve the cryptogram by applying 
the simple principles of frequency. One might, in other words, assume that 
Z c (the letter of greatest frequency) represents iS-,; D c (the letter of next 
greatest frequency) represents T and so on. If tne message were long 
enough this simple procedure might more or less quickly give the solution. 
But the message is relatively short and many difficulties would be encoun* 
tered. Much time and effort would be expended unnecessarily, because it is 
hardly to be expected that in a message of only 235 letters the relative 
order of frequency of the various cipher letters should exactly coincide 
with, or even closely approximate the relative order of frequency of let¬ 
ters of normal plain text found in a count of 50,000 letters. It is to be 
emphasized that the beginner must repress the natural tendency to place 
too much confidence in the generalized principles of frequency and to rely 
too much upon them. It is far better to bring into effective use certain 
other data concerning normal plain text which thus far have not been brought 
to notice. 


25. Further data concerning normal plain text. - a. Just as the in¬ 
dividual letters constituting a large volume of plain text have more or 
less characteristic or fixed frequencies, so it is found that digraphs and 
trigraphs have characteristic frequencies, when a large volume of text is 
studied statistically. In Appendix 1, Table 1 are shown the relative fre¬ 
quencies of all digraphs appearing in the 260 telegrams referred to in 
Paragraph 9 e. It will be noted that 546 of the 676 possible pairs of let¬ 
ters occur in these telegrams, but whereas many of them occur but once or 
twice, there are a feiv which occur hundreds of times. 

In Appendix 1 will also be found several other kinds of tables and 
lists which will be useful to the student in his work, such as the relative 
order of frequency of the 50 digraphs of greatest frequency, the relative 
order of frequency of doubled letters, doubled vowels, doubled consonants, 
and so on. It is suggested that the student refer to this appendix now, to 
gain an idea of the data available for his future reference. Just how these 
data may be employed will become apparent very shortly. 

26. Preparation of the work sheet. - a. The details to be considered 
in this paragraph may at first appear to be superfluous but long experience 
has proved that systematization of the work, and preparation of the data in 
the most utilizable, condensed form is most advisable, even if this takes 


1 This possible step is mentioned here for the purpose of making it clear 
that the plain-component sequence completion method cannot solve a case in 
which transposition has followed or preceded monoalphabetic substitution 
with standard alphabets. Gases of this kind will be discussed in a later 
text. It is sufficient to indicate at this point that the frequency dis¬ 
tribution for such a combined substitution-transposition cipher would pre¬ 
sent the characteristics of a standard alphabet cipher - and yet the method 
of completing the plain-component sequence would fail to bring out any 
plain text. 
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some time. In the first place if it merely serves to avoid interruptions 
and 'irritations occasioned by failure to have the data in an instantly 
available form, it will pay by saving mental ./ear and tear. In the second 
place, especially in the case of complicated cryptograms, painstaking care 
in these details, while it nay not always bring about success, is often 
the factor that is of greatest assistance in ultimate solution. The de¬ 
tailed preparation of the data may be irksome to the student, and he may 
be tempted to avoid as much of it as possible, but, unfortunately, in the 
early stages of solving a cryptogram he does not know (nor, for that mat¬ 
ter, does the expert always know) just which data are essential and which 
may be neglected. Even though not all of the data may turn out to have 
been necessary, as a general rule, time is saved in the end if all the 
usual data are prepared as a regular preliminary to the solution of most 
cryptograms. 

b. First, the cryptogram is recopied in the ford of a work sheet. 

This sheet should be of a good quality of paper so as to withstand con¬ 
siderable erasure. If the crypto ;ran is to be copied by hand, cross-sec¬ 
tion paper of squares is extremely useful. The writing should be in 
ink, and plain, carefully made roman capital letters should be used in all 
cases. If the cryptogram is to be copied on a typewriter, the riboon em¬ 
ployed should be impregnated with an inx that will not smear or smudge 
under the hand. 

c. The arrangement of the characters of the cryptogram on the work 
sheet is a matter of considerable importance. If the cryptogram as first 
obtained is in groups of regular length (usually five characters to a 
group) and if the monoliteral-froquency distribution shows the cryptogram 
to be monoalphabetic, the characters should be copied without regard to 
this grouping. It is advisable to allow one space between letters, and 

to write a constant number of letters per line, approximately 25. At least 
two spaces, preferably three spaces should be left between horizontal lines. 
Care should be taken to avoid crowding the letters in any case, for this is 
not only confusing to the eye but also mentally irritating when later it is 
found that not enough space has been left for making various sorts of marks 
or indications. If the cryptogram is originally in what appears to be word 
lengths (and this is the case as a rule only with the cryptograms of ama¬ 
teurs), naturally it should be copied on the work sheet in the original 
groupings. If further study of a cryptogram shows that some special group¬ 
ing is required, it is best to recopv it on a fresh work sheet rather than 
to attempt to indicate the new grouping on the old work sheet. 

d. In order to be able to locate or refer to specific letters or 
groups of letters with speed, certainty, and without possibility of con¬ 
fusion, it is advisable to use coordinates applied to the line3‘ and columns 
of the text as it appears on the work sheet. To miniaize possibility of 
confusion, it is best to appl/ letters to the horizontal lines of the text, 
numbers to the vertical columns. In referring to a letter the horizontal 
line in which the letter is located is usually given first. Thus, referring 
to the work sheet shown below, coordinates A17 designate the letter Y, the 
17th letter in the first line. The letter I is usually omitted from the 
series of line indicators, so as to avoid confusion with the figure 1. If 
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lines are limited to 25 letters each, then each set of 100 letters of the 
text is automatically blocked off by remembering that 4 lines constitute 
100 letters. 

e. Above each character of the cipher text may be some indication 
of the frequency of that character in the whole cryptogram. This indica¬ 
tion may be the actual number of times the character occurs, or, if colored 
pencils are used, the cipher letters may be divided up into three cate¬ 
gories or groups- high frequency, medium frequency, and low frequency. 

It is perhaps simpler, if clerical help is available, to indicate the 
actual frequencies. This saves constant reference to the frequency tables, 
which interrupts the train of thought, and saves considerable time in the 
end. 


f. After the special frequency distribution, explained in P&r. 27 
below, has been constructed, repetitions of digraphs and trigraphs should 
be underscored. In so doing, the student should be particularly watchful 
of trigraph repetitions which can be further extended into tetragraphs 
and polygraphs of greater length. Repetitions of more than six or seven 
characters should be set off by heavy vertical lines, as they indicate 
repeated phrases and are of considerable assistance in solution. Revers¬ 
ible digraphs should also be indicated by an underscore with a loop in 
it. Anything which stri :es the eye as being peculiar, unusual, or sig¬ 
nificant as regards the distribution or recurrence of the characters 
should be noted. All these marks should, if convenient, be made with ink 
so as not to cause smudging. The work sheet will now appear as shown 
herewith. 


1 2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

10 19 

23 

35 

19 

10 

3 

19 

15 

5 

5 

35 

19 

19 

35 

23 

14 

10 

5 

19 

15 

4 

35 

23 

10 

A. S F 

D 

Z 

F 

I 

0 

G 

H 

L 

P 

Z 

F 

G 

Z 

D 

Y 

S 

P 

F 

H 

B 

Z 

D 

S 

19 16 

15 

22 

19 

5 

5 

5 

16 

23 

19 

19 

14 

16 

3 

16 

19 

16 

15 

22 

19 

8 

23 

35 

35 

3. G V 

H 

T 

F 

U 

P 

L 

V 

D 

F 

G 

Y 

V 

J 

V 

F 

V 

H 

T 

G 

A 

D 

Z 

Z 

8 10 

22 

14 

23 

35 

14 

19 

35 

3 

35 

22 

19 

5 

22 

16 

22 

35 

4 23 

16 

19 

15 

22 

35 

C. A I 

T 

Y 

D 

Z 

Y 

F 

Z 

J 

Z 

T 

G 

P 

T 

V 

T 

Z 

B 

D 

V 

F 

H 

T 

Z 

23 19 

8 

10 

4 

19 

10 

23 

35 

14 

16 

22 

8 

3 

10 

14 

16 

22 

3 

19 

16 

2 

19 

35 

35 

-I 

X 

S 

B 

G 

I 

D 

Z 

Y 

V 

T 

X 

. 0 

I 

Y 

V 

T 

3 

F 

V 

M 

G 

Z 

Z 

22 15 

5 

5 

16 

8 

35 

23 

19 

2 

15 

22 

35 

8 

10 

22 

14 

23 

35 

14 

4 

23 

16 

19 

15 

3. T H 

L 

L 

V 

X 

Z_ 

D 

_F 

xl 

H. 

T 

Z 

A 

I 

JL 

Y 

D 

Z 

Y 

B 

D 

V 

F 

H 

22 35 

23 

19 

2 

35 

23 

35 

35 

3 

10 

8 

10 

10 

19 

35 

14 

19 

8 

16 

19 

10 

5 

19 

35 

F._T_Z_ 

D 

F 

K 

Z 

D 

Z 

Z 

J 

5 

X 

I 

S 

G 

Z 

Y 

G 

A 

V 

F 

S 

L 

G 

Z 

23 22 

15 

15 

22 

1 

23 

35 

2 

10 

16 

22 

14 

35 

23 

3 

35 

19 

19 

15 

22 

35 

8 

10 

22 

G. £> T 

H 

a 

T 

C 

D 

Z 

R 

5 

V 

T 

Y 

Z 

D 

0 

Z 

F 

F 

H. 

T 

Z 

A 

I 

T 

14 23 

35 

14 

19 

8 

16 

23 

19 

35 

35 

22 

2 

15 

10 

22 

14 

35 

14 

10 

23 

35 

19 

15 

5 

H. Y D 

JL 

Y 

G 

A 

V 

p 

G 

Z 

Z 

T 

K 

H 

I_ 

T 

_Y 

Z 

Y 

S 

D 

Z 

G 

H 

U 

35 19 

35 

22 

19 

5 

5 

19 

23 

10 

8 

1 

19 

15 

8 

8 

10 

2 

5 

35 

23 

19 

5 

10 

23 

J. Z F 

Z 

T 

G 

U 

P 

G 

D 

I 

X 

n 

G 

H 

X 

A 

S 

R 

U 

z_ 

D 

_F 

U 

I 

D 

3 19 

15 22 

16 

3 

8 

19 

8 

8 
















K. 3 G 

H 

T 

V 

3 

A 

G 

X 

X 
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27. Triliteral-frequency distributions. - a. In what has gone before, 
a type of frequency distribution known as a monoliteral-frequency, bar-dis¬ 
tribution was used. This, of course, shows only the number of times each 
individual letter occurs. In order to apply the normal digraph and trigraph 
frequency data (given in Appendix 1) to the solution of a cryptogram of the 
type now being studied, it is obvious that the data with respect to digraphs 
and trigraphs, occurring in the cryptogram should be compiled and should be 
compared with the data for normal plain text. In order to accomplish this 
in suitable manner, it is advisable to construct a slightly more complicated 
form of distribution termed a triliteral-frequencv distribution. 

b. Given a cryptogram of 50 or more letters and the task of determin¬ 
ing what trigraphs are present in the cryptogram, there are three ways in 
v'hich the data may be arranged or assembled. One may require that the data 
show. 


(1) Each letter with its two succeeding letters, 

(2) Each letter \dfch its two preceding letters; 

(3) Each letter with one preceding letter and one 
succeeding letter. 

c. a distribution of the first of the three foregoing types nay be 
designated as a "trilitera1-frequency distribution showing two suffixes"; 
the second type may be designated as a "triliteral-frequency distribution 
showing two prefixes"; the third type may be designated as a "triliteral- 
frequency distribution showing one prefix and one suffix." ,>uadriliteral- 
and pentaliteral-frequency distributions may occasionally be found useful. 

d. .;hich of these three arrangements is to be employed at a specific 
time depends largely upon what the data are intended to show. For present 
purposes, in connection with the solution of a raonoalphabetic substitution 
cipher employing a mixed alphabet, possibly the third arrangement, that 
showing one prefix and one suffix, is most satisfactory. 

e. It is convenient to use cross-section paper for the construction 
of a triliteral-frequency distribution in the fora of a bar-distribution 
showing crests and troughs, such as that in Figure 14. In that figure the 
prefix to each letter to be recorded is inserted in the upper half of the 
cell directly opposite the cipher letter bqing recorded; the suffix to each 
letter is inserted in the lower half of the cell directly opposite the let¬ 
ter being recorded; and in each case the prefix and tho suffix to the let¬ 
ter being recorded occupy the same cell, the prefix being directly above the 
suffix. The number in parentheses gives the total frequence for each Tetter. 


Heretofore such a distribution has been termed a "trigraphic-frequency 
table". It is thought that the word "fcriliteral"is more suitable, to cor¬ 
respond with the designation ‘ , monoliteral ,, in the case of the distribution 
of the single letters. The use of the word "distribution" to replace the 
word "table" has already been explained. 
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QOiQlaiSJD TABLE OF 'W_*L21jj.S 
Trigraphs 


DZ-9 

23-5 

VP-4 

D3Y-4 

ZD-9 

TY-5 

VT-4 

H23-4 

H2-8 

Frl-4 

Z7-4 

ITY-4 

ZY-6 

3E-4 

Z2-4 

3D7-4 

DP-5 

GZ-5 

12-4 

3Z-4 

.0.2-3 


FH2-3 

2YD-3 

YDZ-3 

ZAI-3 


Loti :er Polygraphs 

H2ZAI2YDZY-2 
BDVPH1LDP-2 
Z.UWDSr-3 
3EJE-3 


GI 

33 

VG 

YZ 

ZO 

GZ 

zs 

72 

BY 


DU ax 

z 1 : m 

PH '7H 
Z? PD 
',■3 2U 32 
Dir ZH GE 
VII DZ GU * 
DU YA El 


IY 

HZ 

VY 

HO 

DE 

HZ 

IY 





YS 


EV 

LZ 

F2 










AH 

HL 





ZP 


DX 

YA 

HT 

UD 









Zil 





12 


VH 

SZ 

2H' 

DX 









YD 

VE 


SO 



ZP 


Y3 

III 

P2 

HT 









XV 

VX 


ZS 

GV 



3V 

YZ 


W 

D3 

BI 

TE 

Ii2 

TL 

AT 









PL 

13 

VZ 


ZI- 



AZ 


2U 

2A 

P2 

AT 



S3 



UG 



JX 

EV 

PI 

GV 

YD 


VP 


EH 

PY 

V2 

OY 



LV 



3T 



XB 

ZG 

HZ 

ZI 

S3 


ZS 

VA 

Z3 

SV 

V2 

3D 

23 


EL 


DS' 

UL 


SU 

DG 

- IY 

GX 

ZI 

ZD 


ZY 

D3 

ZI 

PZ 

?B 

AT 

ZZ 

TH 

EV 

PH 

XI 

SP 


Yx 

HG 

HZ 

3D 

HZ 

2D 



SD 

OH 

3L 

PO 

W 

PZ 

HE 

VG 

13 

LZ 


zs 

-P 

HP 

PE 

A 

(8) 

B 

(4) 

C 

(1) 

D 

(23) 

E 

(3) 

(19)(19)(15)(10) 

J 

(3) 

K 

(2) 

L 

(5) 

LI i, 0 

(2) (0) (3) 

E 

(5) 

-1 

(0) 

a 

(2) 

S 2 
(10)(22) 

U 

(5) 


UD 

PT 

UP 

DO 

YY- 

22 


OP 

YD 

DH 

GD 

GY 

ZJ 

DZ 

1Z> 

ID 

DY 


ZS 32 
2Z GZ 
ZG DY 
2D 2D 
TZ 2B 
ZG J2 
X- 3B PJ 
GX 2D DY 
HA IV‘ ZA 
3V 


I\l 3V DZ 
SI ZP BD 
VZ 2D 3D 
TO GV EF 
XG FS DS DP 

Vi! X Y Z 


FiU. 14 
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For example, in Fig. 14, note the prefixes and suffixes of the letter D,! 

FZZVAYBZIZYBZZZCZYVSGZI 

2YSFZZVFZFZVFZTZ0ZG2IFE 

f. The triliteral-frequency distribution is now to be examined with 
a view to ascertaining what digraphs and trigraphs occur two or more times 
in the cryptogram. Consider the pair of lines containing the prefixes and 
suffixes to D c in the distribution, as sho\/n directly above. This pair of 
lines shows that the following digraphs appear in the cryptogram: 


Digraphs based on prefixes Digraphs based on suffixes 


FD, 

ZD, 

ZD, 

VD, 

AD, 

YD, 

BD, 

DZ, 

DY, 

DS, 

DF, 

DZ, 

DZ, 

DV 

ZD, 

ID, 

ZD, 

YD, 

BD, 

ZD, 

ZD, 

DF, 

DZ, 

DF, 

DZ, 

DV, 

DF, 

DZ 

ZD, 

CD, 

ZD, 

YD, 

VD, 

SD, 

GD, 

DF, 

DZ, 

DO, 

DZ, 

DG, 

DZ, 

DI 

ZD, 

ID 






DF, 

DE 







The nature of the tabulation in the triliteral-frequency distribution is 
such that in finding what digraphs are present in the cryptogram it is im¬ 
material whether the prefixes or the suffixes to the cipher letters are 
studied, so long as one is consistent in the study . For example, in the 
foregoing list of digraphs based on the prefixes to D c , the digraphs FD, 

ZD, ZD, VD, etc., are found; if now, the student will refer to the suffixes 
of F c , Z Q , V 0 , etc., he will find the very same digraphs indicated. This 
being the case, the question may be raised as to v/hat value there is in 
iisting both the prefixes and the suffixes to the cipher letters. The 
answer is that by so doing the trigraphs are indicated at the same time. 

For example, in the case of D c , the following trigraphs are indicated* 

FDZ, ZDY, ZDS, VDF, ADZ, YDZ, BDV, ZDF, IDZ, ZDF, YDZ, BDV, ZDF, 

ZDZ, ZDT, CDZ, ZDO, YDZ, VDG, SDZ, GDI, ZDF, IDE. 

g. The repeated digraphs and trigraphs can now be found quite readily. 
Thus, in the case of D c , examining the list of digraphs based on suffixes, 
the following repetitions are noted: 

DZ appears 9 times 
DF appears 6 times 
DV appears 2 times 

Examining the trigraphs with D c as central letter, the following repeti¬ 
tions are noted: 


ZDF appears' 4 times 
YDZ appears 3 times 
BDV appears 2 times 

h. It is unnecessary, of course, to go through the detailed procedure 
set forth in the preceding subparagraphs in order to find all the repeated 
digraphs and trigraphs. The repeated trigraphs with D c as central letter 
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can be found merely from an inspection of the prefixes and suffixes op¬ 
posite D c in the distribution. It is necessary only to find those cases 
in which two or more prefixes are identical at the same time that the suf¬ 
fixes are identical. For example, the distribution shows at once that in 
four oases the prefix to D is Z Q at the same time that the suffix to this 
letter is F c . Hence, the trigraph ZDF appears four times. The repeated 
trigraphs may all be found in this manner. 

i. The most frequently repeated digraphs and trigraphs are then as¬ 
sembled in what is termed a c ondensed table of repetitions , so as to bring 
this information prominently before the eye. As a rule, digraphs which 
occur less than four or five times, and trigraphs which occur less than 
three or four times may be omitted from the condensed table as being rela¬ 
tively of no importance in the study of repetitions. In the condensed 
table the frequencies of the individual letters forming the most important 
digraphs, trigraphs, etc., should be indicated. 

28. Classifying the cipher letters into vowels and consonants. - a. 
Before proceeding to a detailed analysis of the repeated digraphs and tri¬ 
graphs, a very important step can be taken which will be of assistance not 
only in the analysis of the repetitions but also in the final solution of 
the cryptogram. This step concerns the classification of the high-fre¬ 
quency letters into two groups; vowels and consonants. For if the crypt¬ 
analyst can quickly ascertain the equivalents of the four vowels, A, Hi, I, 
and 0, and of only the four consonants, II, R, S, and T, he will then have 
the values of approximately two-thirds of all the cipher letters that oc¬ 
cur in the cryptograms the values of the remaining'letters can almost be 
filed in automatically. 

b. The basis for the classification will be found to rest upon a 
comparatively simple phenomenon, the associational or combinatory be¬ 
havior of vowels is, in general, quite different from that of consonants. 

If an examination be made of Table 7Bin Appendix 1, showing the relative 
order of frequency of the 18 digraphs composing 25 per cent of English 
telegraphic text, it will be seen that the letter 3 enters into the composi¬ 
tion of 9 of the 18 digraphs; that is, in exactly half of all the cases the 
letter 3 is one of the two letters forming the digraph. The digraphs con¬ 
taining E are as follows; 

ED EN ER ES 

ME RE SE TE V3 

The remaining nine digraphs are as follows. 

AM HD OR ST 
IN MT TH 

OH TO 

None of the 18 digraphs are combinations of vowels . Note now that of the 
9 combinations with E, 7 are with the consonants N, R, S, and T, one is 
with D, one is with V, and none is with any vowel . In other words, Ep com¬ 
bines most readily with consonants but not with other vowels, or evenwith 
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itself. Using the terms often employed in the chemical analogy, E shows a 
great "affinity" for the consonants ii, H, S, T, but not for the vowels. 
Therefore, if the letters of highest frequency occurring in a given crypto¬ 
gram are listed, together with the number of times each of them combines 
with the cipher equivalent of S , those which show considerable combining 
power or affinity for the cipher equivalent of Ep may be assumed to be the 
cipher equivalents of N, .1, S, T d j those which do not show any affinity for 
the cipher equivalent of E p may be assumed to be the cipher equivalents of 
A, I, 0, Up. Applying these principles to the problem in hand, and examin¬ 
ing the triliteral-frequency distribution, it is quite certain that Z c = E p , 
not only because Z c is the letter of highest frequency, but also because it 
combines with several other high-frequency letters, such as D c , F c , G c , etc. 
The nine letters of next highest frequency ares 




23 

22 19 

19 16 . 15 14 10 

10 




D 

T F 

G V H Y S 

I 


Let the combinations 

these lotters : 

form with Z c be indicated in 

the follow- 

ing manner. 







Ho. of times 







Z c occurs / as 







prefix 

i 






Cipher Letter:- 

D(23) 

,T(22) 

F(19) 

G(19) V(16) H(15) 

Y(14) 

S(10) 1(10) 

llo. of times 

w 


— 

m 

= 



Z c occurs as == 
suffix 

Consider D c . It occurs 23 times in the message and 18 of those times it is 
combined with Z 6 , 9 times in the form Z C D C (» E0 p ), and 9 times in the form 
D C Z C (=0Ep). It is clear that D c must be a consonant. In the same way, 
consider T c , which shows 9 combinations with Z c , 4 in the form Z C T 0 (= E0 p ) 
and 5 in the form T C Z C (» 0E p ). The letter T c appears to represent a con¬ 
sonant, as do also tho lotters F c , G Q , and Y c . On the other hand, consider 
V c , occurring in all 16 times but never in combination with Z c ; it appears 
to represent a vowel, as does also the letters H c , S c , and I Q . So far, 
then, the following classification would seem logical: 

Vo y/el 8 Consonants 

Z 0 (=3 p )> V c- H - S o.Ic V T 0> P c) G c> 

29. Further analysis of the letters representing vowels and consonants, 
a. 0 p is usually the vo’-'el of second highest frequency. Is it possible to 
determine which of the letters V, H, S, I c is the cipher equivalent of 0 p i 
Let reference bo made again to Table H in Appendix 1, where it is seen that 
the 10 most frequently occurring diphthongs are. 

Uiphthong - 10 OU EA El AI IS AU EO AY US 

Frequency - 406 365 345 273 172 131 128 121 120 114 
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If V, ri, S, I c are really the cipher equivalents of A, I, 0, U p (not re,- 
spectively) , perhaps it is possible to determine which is vhich bv examin- 
ing the combinations they ./lake among themselves and with Z c (« E p ) . Let 
the combinations of V, H, S, I, and Z that occur in the message be listed. 
There are only the following; 

ZZ C (= li3_) - 4 HI - 1 

VH 1 - 2 SV - 1 

HH - 1 IS - 1 


Note the doublet HH C $ if H c is a vowel, then the chances are excellent 
that II C = 0 , becauso the doublets aAp, II , UU_, are practically non-ex¬ 
istent, whereas the double vowel combination, 0D p , is of next highest 
frequency to the double vowel combination, EEy. If H c = 0 p , then V c must 
be I p because the digraph VH C occurring two tines in the message could 
hardly be A0 p , or U0 p , whereas the diphthong Iu p is the one of high fre¬ 
quency in English. So far then, the tentative (because so far unverified) 
results of the analysis are as follows. 


Z c = E p H c = 0 p V c * I p 

This leaves only two letters, I c and S c , classified as vowels, to be sep¬ 
arated into A p and U p . Note the digraphs; 


HI 

SV 

IS 


c 

c 

c 


09 


Only two alternatives are open. 


(1) Either I c = A p and S c = U p , 

(2) Or I c - U p and S c = A p . 


If the first alternative is selected, then 


HI C 

sv c 

IS 0 


If the second alternative is selocted, then 


Hie » 0U p 
SV C = AI p 
IS C - UA p 

The eye finds it difficult to choose between these alternatives; but sup¬ 
pose the frequency values of the plain-text diphthongs as given in Table 11 
of Appendix 1 are added for each of these alternatives, giving the follow¬ 
ings 
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HI C a Grip, frequency value = 72 
SV C = UIp, frequency value » 46 
IS C s AUp, frequency value = 128 
Total 246 


HI C = OU , frequency value *355 
SV C = UIp, frequency value =172 


Mathematically, the second alternative is almost twice as probable as the 
first. Let it be assumed to be correct and the following (still tentative) 
values are now at hand: 


^c = H c - Op Vc = Ip S c — Ap I c = Up 

b. Attention is now directed to the letters classified as consonants, 
ilow far is it possible to determine their values? The letter, D c , from 
considerations of frequency alone, would seem to be T , but its frequency, 
23, is not considerably greater than that for T 0 . It P is not much greater 
than that for F c or G c , with a frequency of 19 each. But perhaps it is 
possible to determine not the value of one letter alone but of two letters 
at one stroke. To do this one may make use of a tetragraph of considerable 
importance in English, viz., TIOKg. For if the analysis pertaining to the 
vowels is correct, and if VH C * I0p, then an examination of the letters 
immediately before and after the digraph VH in the cipher text might dis¬ 
close both T p and Up. Reference to the text gives the following: 

GVHT 0 FVHT C 

eio« p eiOGp 

The letter T c follovrs VHc in both cases, indicates very probably that T c * 

Up5 hut as to whether G c or F 0 equals T p cannot be decided. However, two 

conclusions are clear: first, the letter D 0 is neither T p nor Np, from 
which it follows that it must be either Rp or S p ; second, the letters G c 
and F c must be either T p and Sp, respectively, or S and T p respectively, 
because the only tetragraphs usually found (in English) containing the 
diphthong IOp as central letters are SIOUp and TIOIIp. Thi3 in turn means 
that as regards D c , the latter cannot be either R p or S p ; it must be R p , 
a conclusion which is corroborated by the fact that ZD C ( = ER p )and UZ 0 
(= R3p) occur 9 times each. Thus far, then, the identifications, when in¬ 
serted in an enciphering alphabet, are as follows. 

Plain : i B C J) i F OH I J K Lil H 0 P 1 1 8 T U Y i X I i 

Cipher: S Z V TH JGFI 
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30. Substituting deduced values in the cryptogram. - a. Thus far 
the analysis has been almost purely hypothetical, for as yet not a single 
one of the values deduced from the foregoing analysis has been tried out 
in the cryptogram. It is high time that this be done, because the final 
test of the validity of the hypotheses, assumptions and identifications 
made in any cryptographic study is, after all, only this: do these 
hypotheoes, assumptions and identifications ultimately yield verifiable, 
intelligible, plain-text when consistently applied to the cipher text? 

b. At the present stage in the process, since there are at hand 
the assumed values of but 9 out of the 25 letters that appear, it is ob¬ 
vious that a continuous "reading" of the cryptogram can certainly not be 
expected from a mere insertion of the values of the 9 lotters. However, 
the substitution of these values should do two things: first, it should 
immediately disclose the fragments, outlines, or "skeletons" of "good" 
words in the text; and second, it should disclose no places in the text 
where "impossible" sequences of letters are established. By the first is 
meant that the partially deciphered text should show the outlines or 
skeletons of v/ords such as may be expected to be found in the corcmunica- 
tion; this will become quite clear in the next subparagraph. By the 
second is meant that sequences, such as "AOOjSN" or "THRSEiTO" or the like, 
obviously not possible or extremely unusual in normal English text, must 
not result from the substitution of the tentative identifications result¬ 
ing from the analysis. The appearance of several such extremely unusual 
or impossible sequences at once signifies that one or more of the assumed 
values is incorrect. 

c. Here are the results of substituting the nine values which have 
been deduced by the reasoning based on a classification of the high-fre¬ 
quency letters into vowels and consonants and the study of the members of 
the two groups: 
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1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 
10 19 23 35 19 10 3 19 15 5 5 35 19 19 35 23 14 10 5 19 15 4 35 23 10 


A SFjjZFIOGHLPZ 
A T R E T SO 3 

3 S T 

19 16 15 22 19 5 5 5 16 23 19 19 

B GVHTFUPLVDFG 
SIONT IRTS 

T S ST 

8 10 22 14 23 35 14 19 35 3 35 22 

G AITYDZYFZJZT 
N RE T E E N 

S 

23 19 8 10 4 19 10 23 35 14 16 22 

D DFXSBGIDZYVT 
RT A S RE IN 
S T 

22 15 5 5 16 8 35 23 19 2 15 22 

E THLLVXZDFMHT 
NO I E R T OH 

S 

22 35 23 19 2 35 23 35 35 3 10 8 

F TZDFKZDZZJSX 

NERT E R E 3 . A 

S 

23 22 15 15 22 1 23 35 2 10 16 22 

G DTHHTCJZRSVT 

RNOON RE AIN 


14 23 35 14 19 8 16 23 19 35 35 22 

H YDZYGAVDGZZT 
RE S IRSEEH 
T T 

35 19 35 22 19 5 5 19 23 10 8 1 

J ZFZTGUPGDIXlf 
E T E N S SR 

S T T 

3 19 15 22 16 3 8 19 8 8 

K 3GHTV3AGXX 
S 0 N I S 

T T 


FGZDYSPFHBZDS 
TSSR .A TO ERA 

ST S 

14 16 3 16 19 16 15 22 19 8 23 35 35 

YVJVFVHTGaDZZ 
I ITIONS REE 

S T 

19 5 22 16 22 35 4 23 16 19 15 22 35 

GPTVTZB DVFHTZ 
S NINE RITONE 

T S 

8 3 10 14 16 22 3 19 16 2 19 35 35 

XOIYVTEFVHGZZ 
IN T I SEE 
S T 

35 8 10 22 14 23 35 14 4 23 16 19 15 

ZAITYDZYBDVFH 
E H RE' RITO 

S 

10 10 19 35 14 19 8 16 19 10 5 19 35 

ISGZYGAVFSLGZ 
ASE S ITA SE 

T T S T 

14 35 23 3 35 19 19 15 22 35 8 10 22 

YZDOZFFHTZAIT 
ER ETTONE N 

S S 

2 15 10 22 14 35 14 10 23 35 19 15 5 

KHITYZYSDZGHU 
0 N E ARSSO 
T 

24 15 8 8 10 2 5 35 23 19 5 10 23 

GHXASRUZDFUID 
SO A E R T R 

T S 


d. No impossible sequences are b-ought to light, and, moreover, several 
long words, nearly complete, stand out in the text. Note the following por¬ 
tions; 
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, , HBZDSGVHTF 
CD 0 3 1ASI0.IT 

T S 

C15 D2 

TVTZBDVFHT ZDF 
(2) HIKE R I T 0 N i3 R T 

& S 

G5 

G Z D T H H T 

S E R ' N 0 0 N 

T 

The words are obviously OPERATIONS, NINE PRISONERS, and AFTERNOON. The 
value G c is clearly T p ; that of F c is S p $ and the following additional 
values are certain; 


(3) 


B c - P p L c = F P 

31. Completing the solution. - a. Each tine an additional value is 
obtained, substitution is at once made throughout the cryptogram. This 
leads to the determination of further values, in an ever-widening circle, 
until all the identifications are firmly and finally established, and the 
message is completely solved. In this case the decipherment is as follows: 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 12 13 

14 

15 

16 

17 

18 

19 

20 

■ 21 

22 

23 

24 25 

s' 

F 

D 

Z 

F 

I 

0 

" G 

h“ 

iT 

P 

Z 

F 

G 

Z 

D 

Y 

S 

P 

F 

H 

B 

Z 

D 

S 

A 

S 

R 

E 

S 

U 

L 

T 

0 

F 

Y 

3 

S 

T 

E 

R 

D 

A 

Y 

S 

0 

P 

E 

R 

A 

G 

V 

H 

T 

F 

U 

P 

L 

V 

D 

F 

G 

Y 

V 

J 

V 

F 

V 

H 

T 

G 

A 

D 

Z 

Z 

T 

I 

0 

N 

S 

3 

Y 

F 

I 

a 

S 

T 

D 

I 

V 

, I 

S 

I 

0 

N 

T 

H 

R 

E 

E 

A 

I 

T 

Y 

J 

Z 

Y 

F 

z 

j 

Z 

T 

G 

P 

T 

V 

T 

Z 

B 

D 

V 

F 

H 

T 

Z 

H 

u 

N 


R 

E 

I) 

S 

E 

V 

E 

N 

T 

Y 

N 

I 

ii 

E 

•P 

• R 

I 

• S 

0 

N 

E 

D 

F 

X 

S 

B 

G 

I 

D 

z 

Y 

V 

T 

X 

0 

I 

Y 

V 

T 

E 

F 

V 

M 

G 

Z 

Z 

R 

S 

C 

A 

P 

T 

U 

R 

3 

3 

I 

N 

G 

L 

U 

D 

I 

N 

G 

S 

I 

X 

T 

3 

E 

T 

H 

L 

L 

V 

X 

z 

D 

F 

II 

H 

T 

Z 

A 

I 

T 

Y 

D 

Z 

Y 

B 

D 

V 

F 

H 

N 

0 

F 

F 

I 

G 

E 

R 

S 

X 

0 

N 

3 

H 

u 

IJ 

D 

R 

3 

0 

P 

R 

I 

S 

0 

T 

.Z 

D 

F 

K 

Z 

D 

Z 

z 

J 

• S 

X 

I 

S 

G 

Z 

Y 

G 

A 

V 

F 

S 

L 

G 

Z 

N 

E 

R 

S 

\{ 

E 

R 

E 

E 

V 

A 

c 

U 

A 

T 

E 

D 

T 

K 

I 

S 

A 

F 

T 

E 

D 

T' 

H 

E 

T 

C 

D 

z 

R 

s 

V 

T 

Y 

Z 

D 

0 

Z 

F 

F 

H 

T 

Z 

A 

I 

T 

R 

N 

0 

0 

N 

Q 

R 

3 

ii 

A 

I 

N 

D 

3 

R 

L 

E 

S 

S 

0 

11 

E 

H 

U 

N 

Y. 

D- 

Z 

Y 

G 

A . 

V 

D 

G 

z. 

Z 

T 

K 

H 

I 

T 

Y 

Z 

Y 

S 

D 

Z 

G 

H 

U 

.D 

R 

E 

D 

T 

H 

I 

A 

T 

3 

E 

N 

y 

0 

U 

II 

D 

3 

0 

A 

R 

E 

T 

0 

B 

Z- 

F 

Z 

T 

G 

-U 

P 

G 

D 

I 

X 

V 

a 

H 

X 

A 

S 

R 

u 

Z 

D 

F 

U 

I 

D 

E 

S 

E 

N 

T- 

3 

Y 

T 

R 

U 

C 

K 

T 

0 

c 

H 

A 

i 

B 

3 

R 

S 

B 

U 

R 

E 

G 

H 

T 

V 

E 

A 

G 

X 

X 
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Messages AS RESULT OF YESTERDAYS OPERATIONS BY FIRST DIVISION THREE HUN¬ 
DRED SEVENTY NINE r US ONERS CAPTURED INCLUDING SIXTEEN OFFICERS ONE HUN¬ 
DRED PRISONERS VERB EVACUATED THIS AFTERNOON HELiAINDER LESS 01® HUNDRED 
THIRTEEN 7/OUNDED ARE TO BE SENT BY TRUCK TO CHAMBERSBURG TONIGHT 

b. The solution should, as a rule, not be considered complete until 
an attempt has been made to discover all the elements underlying the gen¬ 
eral system and the specific key to a message. In this case, there is no 
need to delve further into the general system, for it is merely one of 
monoalphabetic substitution with a mixed cipher alphabet. It is necessary 
or advisable, however, to reconstruct the cipher alphabet because this may 
give clues that later may become valuable. 

c, . Cipher alphabets should, as a rule, be reconstructed by the 
cryptanalyst in the form of enciphering alphabets because they will then 
be in the form in v/hich the encipherer used them. This is important for 
two reasons. First, if the sequence in the cipher component gives evidence 
of system in its construction or if it yields clues pointing toward its 
derivation from a keyword or a key-phrasa, this may often corroborate the 
identifications already made and may lead directly to additional identifi¬ 
cations. A word or two of explanation is advisable here. For example, 
refer to the skeletonized enciphering alphabet given at the end of par. 29bs 

Plain : ABCDEFGHIJKLMNOPQRSTUV r X Y Z 

Cipher* S Z V TH DGFI 

F G 

Suppose the cryptanalyst, looking at the sequence DGFI or DFGI in the cipher 
component, suspects the presence of a keyword-mixed alphabet. Then DFGI 
is certainly a more plausible sequence than DGFI. Again, noting the se¬ 
quence S.. ,Z...V....TH,,D, he might have an idea that the keyword begins 
after the Z and that the TH is followed by AB or BC. This would mean that 
either P, Q p » A, 3 C or B, C c . Assuming that P, Q p = A, B 0 , he refers to 
the frequency distribution and finds that the assumptions P p = A c and Q p * 

B c are not good; on the other hand, assuming that P, Q p * B, C c , the fre¬ 
quency distribution gives excellent corroboration, A trial of these values 
would materially hasten solution because it is often the case in crypt¬ 
analysis that if the value of a .very low-frequency letter can be surely 
established it will yield clues to other values very quickly. Thus, if Q 
is definitely identified it almost invariably will identify U p , and will p 
give clues to the letter following the U p i since it must be a vowel. In 
the case under discussion the identification PQ p = BC c would have turned 
out to be correct. For the foregoing reason an attempt should always be 
made in the early stages of the analysis to determine, if possible, the 
basis of construction or derivation of the cipher alphabet; as a rule this 
can be done only by means of the enciphering alphabet, and not the decipher¬ 
ing alphabet. For example, the skeletonized deciphering alphabet correspond¬ 
ing to the enciphering alphabet directly above is as follows* 


REF ID:A64644 

-65- 

Cinher. ABCDBFGHIJKL.INOP'jflSTUV'.'XYZ 

Plain - RTSOU AN I Z 

S T 

Here no evidences of a keyword-mixed alphabet are seen at alio However, 
if the enciphering alphabet has been examined and shows no evidences of 
systematic construction, the deciphering alphabet should then be examined 
with this in view, because occasionally it is the deciphering alphabet 
which shows the presence of a key or keying element, or which has been 
systematically derived from a word or phrase. The second reason why it 
is important to try to discover the basis of construction or derivation 
of the cipher alphabet is that it affords clues to the general type of 
keywords or keying elements employed by the enemy. This is a psychological 
factor, of course, and may be of assistance in subsequent studies of his 
traffic. It merely gives a clue to the general type of thinking indulged 
in by certain of his cryptographers. 

d. In the case of the foregoing solution, the complete enciphering 
alphabet is found to be as follo\/s; 

Plain ; A 3 C J 2 7 (i ri I J il L -i II 0 P 1STUV7XYZ 

Cipher- 8 D X H LS ATH30 RTH B 0 fl F fl I J !ii N 

Obviously, the letter Q, which is the only letter not appearing in the 
cryptogram, should follow P in the cipher component. Note now that the 
latter is based upon the keyword USaVJiI ORTH, and that this particular 
cipher alphabet has been composed by shifting the mixed sequence based 
upon this keyword six intervals to the right so that the key for the 
message is A = S c . Note also that the deciphering alphabet fails to give 
any evidsnce P of keyword construction based upon the word LEAVEN 1 ,TOKPH, 

Cipher- A B C i) 2 F S H I J K L m N 0 P ; H 5 T B T / X Y Z 

Plain J IIPQRGSTOUV I ? X J L Y Z II A H B I E C D 3 

e. If neither the enciphering or the deciphering alphabet exhibits 
characteristics which give indication of derivation from a keyword by some 
form of mixing or disarrangement, the latter is nevertheless not finally 
excluded as a possibility. The student is referred to pars. 46 and 47 of 
Special Text No. 165, Elementary i.ilitary Cryptography, wherein will be 
found methods for deriving mixed alphabets by transposition methods ap¬ 
plied to keyword-mixed alphabets. Por the reconstruction of such mixed 
alphabets the cryptanalyst must use ingenuity and a knowledge of the more 
common methods of suppressing the appearance of keywords in the mixed 
alphabet s, 

32. General notes on the foregoing solution. - a. The example 
solved above is admittedly a more or less artificial illustration of the 
steps in analysis, made so in order to demonstrate general principles. 

It was easy to solve because the frequencies of the various cipher let¬ 
ters co'responded quite veil with the normal or expected frecuencies. 
However, all cryptograms of the same monoalphabetical nature can be solved 
along the same general lines, after more or less experimentation, depend¬ 
ing upon the length of the cryptogram, the skill, and the experience of 
the cryptanalyst. 
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b. It is no cause for discouragement if the student's initial at¬ 
tempts to solve a cryptogram of this type require much more time and ef¬ 
fort than were apparently required in solving the foregoing purely il¬ 
lustrative example. It is indeed rarely the case that every assumption 
made by the cryptanalyst proves in the end to have been correct; more 
often is it the case that the majority of his initial assumptions are in¬ 
correct, and that he loses much time in casting out the erroneous ones. 

The speed and facility with which this elimination process is conducted 
is in many cases all that distinguishes the expert from the novice. 

c. Nor will the student always find that the initial classification 
into vowels and consonants can be accomplished as easily and quickly as 
was apparently the case in the illustrative example. The principles in¬ 
dicated are very general in their nature and applicability, and there are, 
in addition, some other principles that may be brought to bear in case of 
difficulty. Of these, perhaps the most useful are the following. 

(1) In nomal English it is unusual to find two or 
three consonants in succession, each of high frequency. 

If in a cryptogram a succession of three or four let¬ 
ters of high-frequency appear in succession, it is 
practically certain that at least one of these repre¬ 
sents a vowel. 

(2) Successions of three vowels are rather unusual 
in English.2 Practically the only time this happens is 
when a word ends in two vowels and the next word begins 
with a vowel. 3 

(3) .lien two letters already classified as vowel- 
equivalents are separated by a sequence of six or more 
letters, it is either the case that one of the supposed 
vowel-equivalents is incorrect, or else that one or more 

_ of the intemediate letters is a vouel-equivalent.^ _ 

Sequences of seven consonants are not impossible, however, as in 
STRENGTH THROUGH. _ 

^ Note that the -rord RADIOED, ?q.st tense of the verb RADIO, is coming into 
usage. _‘_ 

® A sequence of five vowels is not impossible, however, as in YOU AUT HORIZE. 

A Some cryptanalysts place a good deal of emphasis upon this principle as a 
method of locating the remaining vowels after the first two or three have 
been located. They recommend that the latter be underlined throughout the 
text and then all sequences of five or more letters showing no underlines 
be studied attentively. Certain letters which occur in several such se¬ 
quences are sure to be vowels. An arithmetical aid in the study is as fol¬ 
lows: Take a letter thought to be a good possibility as the cipher equiva¬ 
lent of a vowel (hereafter termed a possible vowel-equivalent) and find the 
length of each interval from the possible vowel-equivalent to the next know n 
(fairly surely determined) vowel-equivalent, l.ultiply the interval by the 
number of times this interval is found. Add the products and divide by the 
total number of intervals considered. This will give the mean interval for 
that possible vowel-equivalent. Jo the same for all the other possible vowel- 
equivalents. The one for which the mean is the greatest is most probably a 
vowel-equivalent. Underline this letter throughout the text and repeat the 
process for locating additional vowel-equivalents, if any remain to be located. 
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(4) Reference to Table 7B of Appendix 1 discloses 
the following. 

Distribution of 1st 18 digraphs forming 25/o of English 
text 


No. of consonant-consonant digraphs - 4 
No. of consonantsvo ;we 1 digraphs - 6 
No. of vowel-consonant digraphs - 8 
llo. of vowel-vowel digraphs - 0 


Distribution of 1st 53 digraphs forming 50,4 of English 
text- 


No..of consonant-consonant digraphs - 8 

.No. of consonant-vowel digraphs . - 23 

No. of vowel-consonant digraphs ' - 18 

No. of vowel-vowel digraphs 4 


The latter tabulation shows that of the first 53 di¬ 
graphs which form 50,i of English text, 41 of bhem, 
that is, over 75/*, are combinations of a vowel with a 
consonant. In short, in normal English the vowels and 
the high-frequency consonants are in the long run dis- , 
tributed^ fairly evenly and regularly throughout the 
text. 


( 5 ) As a rule, repetitions of trigraphs in the 
cipher text are composed of high-frequency letters 
forming high-frequency combinations. The latter prac¬ 
tically always contain at least one vowel; in fact, if 
reference is made to Table 13 of Appendix 1, it will be 
noted that 36. of the 56 trigraphs having a frequency 
of 100 or mopre contain one vowel, 17 of them contain 
two vowels, and only three of them contain no vowel• 

In the case of tetragraph repetitions, Table 14 of Ap¬ 
pendix 1 shows that no tetragraph listed therein fails 
to contain at least one vowel, 28 of them contain one 
vowel, 25 contain two vowels, and 2 contain three 
vowels. 

(6) >ite frequently ’'hen two' known vowel-equiva¬ 
lents are separated by six or more letters none of which 
seems to be ’of sufficiently high frequency to repre¬ 
sent one of the .vowels A E I 0, the chances are good 
that the cipher-equivalent of the vowel U or Y is pres¬ 
ent. (See'Fo'ofnote No. 4, page 66) 

(7) The letter Q is invariably followed by U; the 
letters J and V are invariably followed by a vowel. 
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d. In the foregoing example the amount of experimentation or "cutting 
and fitting" was practically nil. (This is not true of real cases as a 
rule.) Jhere such experimentation is necessary, the underscoring of all 
repetitions of several letters is very essential, as it calls attention to 
peculiarities of structure that often yield clues. 

e. After a few basic assumptions of values have been made, if short 
words or skeletons of words do not become manifest, it is necessary to make 
further assumptions for unidentified letters. This is accomplished most 
pften by assuming a word.l Now there are two places in every message which 
lend themselves more readily to successful attack by the assumption of words 
than do any other places, the very beginning and the very end of the message. 
The reason is quite obvious, for although words may begin or end with almost 
any letter of the alphabet, they usually begin and end with but a few very 
common digraphs and trigraphs. In this connection reference should be made 
to Tables 15 and 16 of Appendix 1. Very often the association of letters in 
peculiar combinations will enable the student to note where one word ends 

and the next begins. For example, suppose E, N, S and T have been definitely 
identified, and a sequence like the following is found in a cryptogram. 

- ENTSNE - 

Obviously the break between two words should fall either after the S of 
-3 N T S or after the T of E N T, so that two possibilities are offeredv 

. . . E H T S / H 3 . . ., or . . . 3 N T / S II E . . . . Since in English 

there are very few words with the initial trigraph S N E, it is most likely 

that the proper division is . . . E N T S / N E . . . • Obviously, when 

several word divisions have been found, the solution is rendered more easy 
by virtue of the greater ease with which assumptions of additional new 
values may be made* 

33. The "probable-word" method? its value and applicability. - a. 

In practically all cryptanalytic studies, short-cuts can often be made by 
assuming the presence of certain words in the message under study. Some 
writers attach so much value to this kind of an "attack from the rear" that 
they practically elevate it to the position of a method and call it the 
"intuitive .method" or the "probable-word method". It is, of course, merely 
a refinement of what in every-day language is called "assuning" or "guessing" 
a word in the message. The value of making a "good guess" can hardly be 
overestimated, and the cryptanalyst should never feel that he is accomplish¬ 
ing a solution by an illegitimate subterfuge when he has made a fortunate 
guess leading to solution, a correct assumption as to plain text will often 
save hours or days of labor, and sometimes there is no alternative but to 
try to "guess" a word, for occasionally a system is encountered the solu- 

tion of which is absolutely dependent upon this artifice. _ 

1 This process does not involve anything more mysterious than ordinary, 
logical reasoning? there is nothing of the subnormal or supernormal about 
it. If cryptanalytic success seems to require processes akin to those of 
medieval magic, if "hocus-pocus" is much to the fore, the student should 
begin to look for items that the claimant of such success has carefully 
hidden from view, for the mystification of the uninitiated. (See Par. 33 
in this connection.) 
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b. The expression "gooa guess' 1 is used advisedly. For it is "good!* 
in two respects. First, the cryptanalyst must use care in making his as¬ 
sumptions as to plain-text words. In this he must be guided by extraneous 
circumstances leading to the assumption of probable words - not just any 
words that come to his mind. Therefore he must use his imagination but 

he must nevertheless carefully control it by the exercise of good judg¬ 
ment. Second, only if the "guess" is correct and leads to solution, or. 
at least puts him on the road to solution, is it a good guess* - But, while 
realizing the usefulness and the time and labor-saving features of a solu¬ 
tion by assuming a probable word, the cryptanalyst should exercise discre¬ 
tion in regard to how long ha may continue in his efforts with this method, 
sometimes he may actually waste time by adhering to the method too long, 
if straightforward, methodical analysis will yield results more quickly. 

c. Obviously, the ■'probable-word" method has much more applicability 
when working upon material the general nature of which is known, than when 
working upon more or less isolated communications exchanged between cor¬ 
respondents concerning whom or whose activities nothing is known. For in 
the latter case there is little or nothing that the imagination can seize 
upon as a background or basis for the assumptions. 1 

d. Very frequently, the choice of probable words is aided or limited 
by the number and positions of repeated letters. These repetitions may be 
patent , that is, externally visible in the cryptographic text as it orig¬ 
inally stands, or they may be l atent , that is, externally invisible but 
susceptible of being made patent as a result of -the analysis. For example, 
in a monoalphabetic substitution cipher, such as that discussed in the pre¬ 
ceding paragraph, the repeated letters'are directly exhibited in the crypto¬ 
gram; later the student will encounter many cases in which the repetitions 
are latent, but are made patent by the analytical process. «hen the repeti¬ 
tions are patent, then the pattern or formula to v/hich the repeated letters 
conform is of direct use in assuming plain-text words; and when the-text 

is in word-lengths, the pattern is obviously of oven greater assistance. 
Suppose the cryptanalyst is dealing with military text, in which case he 
nay expect such -words'as DIVISION, BATTALION, etc., to be present in the 
text. The positions of the repeated letter I in DIVISION* of the revers¬ 
ible digraph AT, TA in BATTaLION, and so on, constitute for the experienced 
cryptanalyst, tell-tale indications of the presence of these words, even 
when the text is not divided up into its original word lengths. 

^ General Givierge in his' Cours de Cryptographic (p. 121) says "However, 
expert cryptanalysts often employ such details as are cited above [In con¬ 
nection with assuming the presence of 'probable words'!, and the experience 
of the years 1914 to 1913, to cite only those, prove that in practice one 
often has at his disposal elements of this nature, permitting assumptions 
much'more audacious than those v/hich served for the analysis of the last 
example. The reader would therefore be wrong in imagining that such for¬ 
tuitous elements are encountered only in cryptographic works where the 
author deciphers a document that he himself enciphered. Cryptographic 
correspondence, if it is extensive, and if sufficiently numerous working 
data are at hand, often furnishes elements so complete that an author would 
not dare use all of them in solving a p oblem for fear of being accused of 
obvious exaggeration," 





RW-ID:A64 644 


e. The important aid that a study of word patterns can afford in 
cryptanalysis warrants the use of definite terminology ana the establish¬ 
ment of certain data having a bearing thereon. The phenomenon herein 
under discussion, namely, that many words are of such construction as re¬ 
gards the number and positions of repeated letters as to make them readily 
identifiable, will be termed idi o momhistn (from the Greek "idios" - one's 
own, individual, peculiar + "morp'.-o" - form), words which show this 
phenomena will be termed i dicmnrphi c. It will be useful to deal with the 
idiomorphisms symbolically and systematically as described below. 

f. ‘/hen dealing with cryptograms in which the word lengths are de¬ 
termined or specifically shown, it is convenient to indicate their lengths 
and their repeated letters in somo easily recognized manner or by formulas. 
This is exemplified, in the case of the word DIVISION, by the formula 
ABOBDBEF; in the case of the word BATTALION, by the formula ABCCBDE^G If 
the cryptanalyst, during the course of his studies, makes note of striking 
formulas he has encountered, with the words which fit them, after some time 
he will have assembled a quite valuable body of data. And after more or 
less complete lists of such formulas have been established in some systematic 
arrangement, a rapid comparison of the idiomorphs in a specific cryptogram 
with those in his lists will be feasible and ./ill often lead to the assump¬ 
tion of the correct word. Such lists can be arranged according to word 
length, as shown herewithi 

3/aba ; DID, 3V3, EYE 
abb s ADD, ALL, ILL, OFF, etc. 

4/abac : ARAB, AREA, A /AY, etc. 
abca : BOi.B, DEAD, etc. 
abbc ; . . . 

abcb : ... 

etc. etc. 

g. ihen dealing with cryptographic text in which the lengths, of the 
words are not indicated or otherwise determinable, lists of the foregoing 
nature are not so useful as lists in which the words (or parts of words) 
are arranged according to the intervals between identical letters, in the 
following manner; 

1 Interval . 2 Intervals 3 Intervals Repeated digraphs 

-DiD- AbbAcy AbeyAnce COCOa 

-EvE- ArAbiA hAbitAble dERER 

-EyE- AbiAtive lAborAtory ICICle 

division AboArd AbreAst ININg 

revision -AciA- AbroAd bAGgAGe 

etc. etc. etc. etc. 

In Appendix 2 will be found some useful lists of words arranged in this 
manner. 

34. Solution of additional cryptograms produced by the same cipher 
component. - a. To return, after a rather long digression, to the crypto¬ 
gram solved in pars. 28 - 31, once the cipher component of a cipher alphabet 
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has been reconstruct’d, subsequent messages which have been enciphered by 
means of the same cipher component may be solved very readily, and without 
recourse to the principles of frequency, or application of the "probable- 
word' 1 method. It has been se’.n that the illustrative cryptogram treated 
in paragraphs 24 - 31 was enciphered by .juxtaposing the cipher component 
against the normal sequence so that A p = S c . It is obvious that the cipher 
component may be set a ;ainst the plain component at any one of 26 differ¬ 
ent points of coincidence, each yielding a different cipher alphabet. 

After a cipher component has been reconstructed, however, it becomes a 
known sequence, and the method of converting the cipher letters into their 
plain-component equivalents and then completing the plain-component sequence 
begun by each equivalent can be applied to solve any cryptogram which has 
been enciphered by that cipher component. 

b. An example will serve to make the process quite clear. Suppose 
the following message passing between the same two stations as before was 
intercepted shortly after the first message had been solved- 

I y 3 K C jI A N 7 0 F 0 S 3 LFOOH 3 A Z X X 

It is assumed that the same cipher component was used, but with a differ¬ 
ent key letter. First the initial group or two groups are converted into 
their plain-component equivalents by setting the cipher component against 
the normal sequence at any arbitrary point of coincidence. The initial 
letter of the former may as well be set against A of the latter, with the 
following result. 

Plain - A B C J 3 F S « I J K L i! H 0 P 1 R S T D V 'J X Y Z 

Cipher. L iS A V N '/ 0 R T H 3 C D F 6 I J i; ii P S U X Y Z 

Cryptogram ; I Y 3 '7 K C 13 R II '/ ... 

Jquivalents. rfBFRL3H3F... 

The normal sequence initiated by each of these conversion equivalents is 
now completed, with the results shown in Fig. 15. Ilote the plain-text 
generatrix, CL0S3/0URS, which manifests itself 'ithout further analysis. 

The rest of the message may be read either by continuing the same process, 
or, what is even more simple, the key letter of the message may now be 
determined quite readily and the message deciphered by its means. 
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iyb incnn 
PYBFRLBH3F 
'IZCGSiiGIFG 
RADHTHDJOH 
S BEIU03KHI 
TCFJVPFLIJ 

u j g k :r s ,! j k 

V3HLXRHHKL 
•JFIHY&IOL ii 
XGJHZTJPkU 
YHKOAUKQHO 
2ILPBVLR0P 
a J U '} C V U S P Q 
BKiTRDXHTQR 
*CLOSBYOUAS 
DiiPT. FZ PVST 
2 H ]UGA | ; /TU 
FORVHB RXUV 
G P S W I C S Y V V 
HiTXJDTZ’JX 
IRUYiCEUAXY 
JSVZLFVBYZ 
i; T V A M G M C Z a 
LUXBNHXDAB 
MVYC0IY3BC 
H / ZDPJZFCD 
OXAS'jKAGDS 
• Fig. 15 

c. In order that the student may understand without question just 
what is involved in the latter step, that is, discovering the key letter 
after the first two or three groups have been deciphered by the conver¬ 
sion-completion process, the foregoing example ’Till be used. It was noted 
that the first cipher group was finally deciphered as follows. 

Cipher. I Y E V K 
Plain : GLOSS 

Now set the cipher component against the normal sequence so that C p ■ I c . 
Thus; 

Plain : ABODE FGHIJICLH NOP "JRSTUV / X t Z 
Cipher: F G I J K k P S U X Y Z L i A V N '/ ORTH BCD 

It is seen here that when C p = I c then A p = F c . This is the key for the 

entire message. The decipherment may be completed by direct reference to 
the foregoing cipher alphabet. Thuss 

Cipher: I Y 3 7 K C 3 R N 7 0F0S3 LFOOH 3AZXX 

Plain : CLOSE YOURS T a T I 0 II A T T 7 0 P li X X 


lies sage. CLOSE YOUR STATION AT T 10 HI 
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d,. The student should make sure that he understands the fundamental 
principles involved in this quick solution, for they are among the most 
important principles in cryptanalytics. How useful they are will become 
clear as he progresses into more and more complex cryptanalytic studies', 

SECTION VII 

POLYLIT IRaL SUBSTITUTIuh WITH I'OHO-E}UIVaL 3:!T CliHE-l ALPHABETS 

Paragraph 


Analysis of polyliteral, monoalphabetic substitution systems. . . 35 

Historically interesting examples . 36 


35. Analysis of polyliteral, .jonoalphabetic substitution systems. - 

a. ^substitution methods in general may be classified into monoliteral 
and polvliteral systems.In the former there is a strict "one-to-one" 
correspondence between the length of the units of the plain and those of 
the cipher text; that is, 9ach letter of the plain text is replaced by a 
single character in the cipher text. In the latter this correspondence 
is no longer lp.l c but may be lp.2 c , where each letter of the plain text 
is replaced by a combination of two characters in the cipher text; or 
1 -3 C , where a 3-character combination in the cipher text represents a 
single letter of the plain text, and so on. a cipher in which the corre¬ 
spondence is of the Ip*l c type is termed monoliteral in character; one in 
which it is of the l p J2 c type, biliteral; l p 3 q» triliteral, and so on. 

Those beyond the lpJi c type are class.ed together as polvliteral . 

b. When a polyliteral system employs biliteral emivalents, the cipher 
alphabet is said to be bipartite. Such alphabets are composed of a set of 
25 or 26 combinations of a limited number of characters taken in pairs, An 
example of such an alphabet is the following. * 

Plain. aB C B E F G H I JKLliN 0 P'JRS.T U V "/ X Y 
Cipher. -VH IC JT G HE HH HI HT HT HE I •/ IH II IT IE TW TH TI TT TE SI EH El ET 

This alphabet is derived from the square shown in Fig. 16. 

( 2 ) 

WHITE 

¥ 

H 

U) I 
T 


c. If a message is enciphered by means of the foregoing bipartite 
alphabet the cryptogram is still monoalphabetic in character, a frequency 
distribution based upon pairs of letters -'ill obviously have all the 


See Par. 29, Special Text Wo. 166, Advanced -Military Cryptography. 


A 

B 

C 

D 

3 

F 

G 

H 

I-J 

K 

L 

M 

N 

0 

P 

_Q_ 

R 

S 

T 

U 

V 

>V 

X 

Y 

z 


Fig. 16 
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characteristics of a simple, raonoliteral distribution for a i.onoalphabetic 
substitution cipher. 

d. Ciphers of this type, as well as of those of the triliteral, tetra- 
literal, ... type are readily detected externally by virtue of the fact that 
the cryptographic text is composed of but a very limited number of differ¬ 
ent characters. They are handled in exactly the sane manner as are mono¬ 
literal, monoalphabetic substitution ciphers. So long as the same character, 
'or combination of characters is always used to represent the same plain-text 
letter, and so long as a given letter of the plain text is always represented 
by the same character or combination of characters, the substitution is 
strictly monoalphabetic and can be handled in the simple manner described 
under Par. 31 of this text. 

e. nn interesting example in which the cipher equivalents are penta- 
.literal groups and yet the resulting cipher is strictly monoalphabetic in 
character is found in th9 cipher system invented by Sir Francis Bacon over 
300 years ago. Jespite its antiquity the system possesses certain features 
of merit which are well worth noting. Jacon^ proposed the following cipher 
alphabet, composed of permutations of two elements taken five at a timei 


A 

= 

aaaaa 

G 

= 

aabba 

N 

= 

abbaa 

T 

= 

baaba 

B 

= 

aaaab 

H 

= 

aabbb 

0 

= 

abbab 

U-V 

a 

baabb 

C 

= 

aaaba 

I-J 

= 

abaaa 

F 

= 

abbba 

W 

a 

babaa 

D 

= 

aaabb 

K 

= 

abaab 

0 

a 

abbbb 

X 

a 

babab 

E 

= 

aabaa 

L 

= 

ababa 

R 

= 

baaaa 

Y 

= 

babba 

F 

= 

aabab 

U 

= 

ababb 

S 

= 

baaab 

Z 

a 

babbb 


If this were all there were to Bacon’s invention it vould be hardly worth 

bringing to attention. But what he pointed out, with great clarity and 

simple examples, was how such an alphabet might be used to convey a secret 
message by enfolding it in an innocent, external message which might easily 
evade the strictest kind of censorship. As a very crude example, suppose 
that a message is written in capital and lower case letters, any capital 
letter standing for an "a 1 * element of the cipher alphabet, and any small 
letter, for a "b" element. Then the external sentence “All is well with 
me today" can be made to contain the secret message "Help". Thus: 

A L 1 is V J! 1 L ItH m3 

a a b^ b b a. a b a a ^a b a b a_ 

H E L 


T o d a Y 
a b b b a 



1 For a true picture of this cipher, the explanation of which is often dis¬ 
torted beyond recognition even by cryptographers, see Bacon's own descrip¬ 
tion of it as contained in his De Augme nt is Scientiarum (The .advancement of 
Learning) . as translated by any first-class editor, such as Gilbert .*atts 
(1640) or Ellis, opedding, and Heath (1857, 1870). The student is cautioned, 
however, not to accept as true any alleged "decipherments" obtained by the 
application of Bacon's cipher to literary works of the 16th century. These 

readings are purely subjective. _;.. 

^ In the 16th Century, the letters I and J were used interchangeably, as 
were also U and V. 
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Instead of employing such an obvious device as capital and small letters, 
suppose that an "a" will be indicat jd by a very slight shading, or a very 
slightly heavier stroke - Then a secret message might easily be thus en¬ 
folded within an external message of exactly opposite meaning. The number 
of possible variations of this basic scheme is v3ry high. The fact that 
the characters of the cryptographic text are hidden in some manner or other 
has, however, no effect upon ohe strict monoalphabeticity of the scheme. 

36. Historically interesting examples. - a. Two examples of histor¬ 
ical interest will be cited in this connection as illustrations. Juring 
the campaign for the presidential election of 1376 many cipher messages 
were exchanged between the Tilden managers and their agents in several 
states where the. voting was hotly contested. Two years later the New for.: 
Tribune 1 exposed many irregularities in the campaign by publishing the de¬ 
cipherments of many of these messages. These decipherments were achieved 
by tv/d investigators employed by the Tribune, and the plain text of the 
messages seems to show tHat illegal attempts and measures to carry the 
election for Tilden were made by his managers. Here is one of the messages. 

Jacksonville!, iiov. 16 (1876). 


SJO. 7. RANHY, Tallahassee. 

Ppyyemnshyyypimashnsyyssitepaaensh 
nsseusshnsmmpiyysnppyeaapieisdyeshai 
ns.sspeeiyyshnyns ss yepiaanyit ns6hyysp 
yypinsyyssiteraeipimmeisseiyyeissitel 
epyypeeiaassimaayespnsyyianssseissmm 
ppns pins s npinsiraimyyit emyys s peyymmns 
yyssitspyypeepppmaaayypiit 
L'jingle goes up tomorrow. 

J1.NI3L. 

Hxamination of the message discloses that only ten different letters 
are used. It is probable, therefore, that what one has here is a cipher 
which employs a bipartite alphabet and in which combinations of two let¬ 
ters represent single letters of the plaintext. The message is therefore 
rewritten in pairs and substitution of arbitrary letters for the pairs is 
made, as seen below: 

PH YY £ 1 NS HY' YY PI "iIA SH NS YY S3 etc. 

A B C JJI'B F’GH D 3 I etc. 

1 New York Tribune - -Cxtra No. 44’- "The Cipher Dispatches" - New York, 
1879. 
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A triliteral-frequency distribution is then made and analysis of the 
table along the lines illustrated in the preceding section of this text 
yields solution, as follows: 


JACKSONVILLE, Nov. 16. 


GEO. F. FLiNS'f, Tallahassees 

Have Marble and Coyle telegraph for influential men from Delaware 
and Virginia. Indications of weakening here. Press advantage and watch 
Board. L 1 Engle goes up tomorrow. 

DANIEL 

b. The other example, using numbers, is as follows: 

JACKSONVILLE, Nov. 17. 

S. PASCO and E. Li. L'ENGLE. 


84 

55 

84 

25 

93 

34 

82 

31 

31 

75 

93 

82 

77 

33 

55 

52 

93 

20 

90 

66 

77 

65 

33 

84 

63 

31 

31 

93 

20 

82 

33 

66 

52 

48 

44 

55 

42- 

82 

48 

89 

42 

93 

31 

82 

66 

75 

31 

93 


DANIEL. 


There were, of course, several messages of like nature, and examina¬ 
tion disclosed that only 26 different numbers in all were used. Solution 
of these ciphers followed very easily, the decipherment of the one given 
above being as follows- 


JACKSONVILLE, Nov. 17. 


S. PASCO and E. M. L'ENGLE: 

Cocke will be ignored, Eagan called in. Authority reliable. 

DANIEL. 

c. The Tribune experts gave the following alphabets as the result 
of their decipherments: 


AA 

= 

0 

EN 


i 

IT 

a 

D 

NS 

a 

E 

PP 

a 

H 

SS 

a 

N 

AI 

r 

u 

EP 

- 

C 

MA 

a 

B 

NY 

a 

H 

SH 

a 

L 

YE 

a 

F 

El 

<= 

I 

IA 

- 

K 

MM 

a 

G 

PS 

a 

T 

SU¬ 

a 

P 

YI 

a 

X 

ELI 


V 

Iii 

Jt 

S 

UN 

= 

J 

PI 

r 

R 

SP 

= 

\\ 

YY 

= 

A 

20 

s 

D 

33 

a 

N 

44 

'= 

H 

62 

a 

X 

77 

"a 

G 

89 

V 

Y 

25 


K 

34 

= 

V 

48 

= 

T 

66 

a 

A 

82 

a 

I 

93 

a 

E 

27 

* 

S 

39 

= 

P 

52 

a 

U 

68 

a 

F 

84 

a 

C 

96 

a 

M 

31 

= 

L 

42 

« 

it 

55 

a 

0 

75 

a 

B 

87 

* 

V 

99 

= 

J 
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They did not attempt to correlate these alphabets, or at least they say 
nothing about a possible relationship. The present author has, howe^r, 
reconstructed the rectangle upon which these alphabets are based,’ and it 
is given herewith* 


H 1 
I 2 
S 3 
P 4 
A 5 
Y 6 
M 7 
3 8 
N 9 
T 0 


It is amusing, to note that the conspirators selected as their key a phrase 
quite in keeping with, their attempted illegalities; HIS PAY.L3NT; for 
bribery seems to have played a considerable part in that campaign. The 
blank squares in the diagram probably contained proper names, numbers, etc. 

SECTION VIII 

POLTLITJSRAL SUBSTITUTION WITH POLY-EQUIVALENT CIPHER ALPHABETS. 


rurpose of providing poly-equivalent cipher alphabets ..... ,^ ra S^ph 

Solution of a'simple example .. :. .... i .... * 38 

Solution of a more complicated example ..39 

a. subterfuge to prevent decomposition of cipher text into 

component units.... 40 


37-. Purpose of providing poly-equivalent cipher alphabets. - a. It 
has been seen that the characteristic frequencies of letters composing 
normal plain text, the associations they form in combining to fora words, 
and the peculiarities certain of them manifest in such text all afford 
direct clues by means of which ordinary mOnoalphabetio substitution encipher 
raents of such plain text may be more or less speedily solved. This has led 
to the introduction of simple methods for disguising or suppressing the 
manifestations of monoalphabeticity, so far as possible. Basically these 
methods are polyliteral and they will now be presented. 


2d Letter 
or 

Number 


HIS PAYMENT 
1234567890 
















I 





D. 

~_L 


N_ 

W 





7 



■r 


H 




T 



I 

u’ 



I! 







X. 




A_ 


7 







E 


oT 





T 


<r 



X 


Y 




3 



M 



£ 











□ 
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b. Polyliteral substitution may be of two types: 

(1) That wherein each letter of the plain text is 
represented by one and only one polyliteral equivalent. 

For example, in the Francis Bacon cipher described in 
Par. 35 e, the letter Kp is invariably represented by 
the permutation ab aab . For this reason this type of 
system may be more completely described as monoalphabetic, 
polyliteral substitution with mono-equivale nt cipher 
alphabets . 

(2) That wherein, because of the large number of 
equivalents made available by the combinations and per¬ 
mutations of a limited number of elements, each letter 
of the plain text may be Represented by several poly- 
literal equivalents which may be selected at random. 

For example, if 3-1'etter combinations are emp?.oyed there 
are available 26^ or 17,576 equivalents for the 26 letters 
of the plain text; they may be assigned in equal numbers 
of different equivalents for the 26 letters, in which 
case each letter would be representable by 676 different 
3-letter equivalents; or they may be assigned on some 
other basis, for example, proportionately to the relative 
frequencies of plain-t-ext letters. For this reason this 
type of system may be more completely described as mono - 
alphabetic . polvliteral substituti on with poly-equivalent 
cipher alp h abe ts. Some authors term such a system 
“simple substjtution with multiple.equivalents"; others 
term it mo malphabetic substitution with v ariants . For 
the sake of brevity, the latter designation will be 
employed in this text. 

c. The primary object of monoalphabetic substitution with variants 
is, as has been mentioned above, to provide several values which may be 
employed at random in a. simple substitution of cipher equivalents for 
the plain-text letters. In this connection, reference is made to section 
X of Special Text 165, Elementary Military Cryptography, wherein several 
of the most common methods for producing and using variants are set forth. 

d. A word or two concerning the underlying theory from the cryptana- 
lyt'ic point of view of monoalphabetic substitution with variants, may not 
be amiss, 'Yhereas in simple or mono-equivalent, monoalphabetic substitu¬ 
tion- it is seen that: 

(1) “ The same letter of the plain text is invariably 

, represented by but one and always the same character of 

the cryptogram, and 

(2) The same .character of the cryptogram invariably 
represents one and always the same letter of the plain text; 


REF ID:A64644 


- 79 - 

in monoalphabetic substitution with variants; 

(1) The same letter of the plain text may be 
represented by one or more different characters of 
the cryptogram, but 

(2) The same character of the cryptogram 
nevertheless invariably represents one arid always 
the same letter of the plain text. 

38. Solution of a simple example. - a. The following cryptogram 
has been enciphered by the method explained in Par. 52 b of Special 
Text No. 165, 31ementary Military Cryptography, and the steps in solu¬ 
tion will now' be scrutinized. 


CRYPTOGRAM 


68321 

09022 

48057 

65111 

8864‘8 

42036 

45235 

09144 

05764 

22684' 

00225 

57003 

97357 

14074 

8*1524 

40768 

51058 

93074 

92188 

47264 

09328 

04255 

06186 

79882 

85144 

45886 

32574 

55136 

56019 

45722 

76844 

68350 

45219 

71649 

90528 

65106 

11886 

44044 

89669 

70553 

18491 

06985 ‘ 

48579 

33684 

50957 

70612 

09795 

29148 

56109 

08546. 

62062 

65509 

32800 

32568 

97216 

44282 

34031 

84989 

68564 

53789 

12530 

77401 

68494 

38544 

11368 

87616 

56905 

20710 

58864 

67472 

22490 

09136 

62851 

24551 

35180 

14230 

50886 

44084 

06231 

12876 

05579 

58980 

29503 

99713 

32720 

36433 

82689 

04516 

52263 

21175 

06445 

72255 

68951 

86957 

76095 

67215 

53049 

08567 

9730 



b. Assuming that the foregoing remarks had not been made and that 
the cryptogram has just been submitted for solution with no information ’ 
concerning it, the first step is to make a preliminary study to determine 
whether the cryptogram involves cipher or code. The cryptogram appears 
in 5-figure,groups, which may indicate either cipher or code. A few re¬ 
marks will' be made at this point with reference to the method of determin¬ 
ing whether a" cryptogram composed of figure groups' is in code or cipher, 
using the foregoing example. 

c. In the first place, if the cryptogram contains an even number 
of digits, as for example 330 in the foregoing message, this leaves open 
the possibility that it may bo cipher, composed of 165 pairs of digits; 
were the number of digits an exact odd multiple of five, such as 125, 

135, etc., the possibility that the cryptogram is in code of tho 5-figure 
group type must bo considered. Noxt, a preliminary study is made to see 
if there are many repetitions, and what their characteristics are. If 
the cryptogram is code of tho 5-figure group typo, then such repetitions 
as appear should gen er ally be in whole groups of fivo digits, and they 
should bo visible in tho text just as tho mossago stands, unless thp code 
message has undergone encipherment also. If the cryptogram is in cipher, 
then tho repetitions should oxtend beyond tho 5-digit groupings; if they 
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conform to any definite groupings at all they should for the most part 
contain oven numbers of digits since each letter is probably represented 
by a pair of digits. If no clues of the foregoing nature arc present, 
doubts will be dissolved by making a detailed study of frequencies. 

d. A simple 4-part frequency distribution is therefore decided upon. 
Shall the alphabet bo assumed to be a 25- or a 26- character one? If the 
former, then the 2-digit pairs from 01 to 00 fall into exactly four groups 
each corresponding to an alphabet. Since this is tho most common scheme 
of drawing up such alphabets, lot it bo assumed to be true of tho present 


case. The following distributions 
toxt into 2-digit pairs. 

result from the breaking up of the 

01 -/// 

26 -/// 

51 -fUJ 

76 -/UJ! 

02 - 

27 - 

52 -/UJ 

77 -/ 

03 -//// 

28 -/ 

53 -/// 

78 - 

04 -/ 

29 -/ 

54 - 

79 -/ 

05 -NJ 

30 -/// 

55 -//// 

80 -/// 

oe -mil 

31 - 

56 -fHj 

81 - 

07 -/// 

32 -/HJJ 

57 -tUJ/ 

82 - //// 

03 - 

33 -/ 

58 -// 

83 -/ 

09 -//// 

34 -/ 

59 - 

84 -{H/t 

10 -//// 

35 -// 

60 - 

85 -/HJ( 

li -/HJ 

36 -M/ 

61 

86 -Iff 

12 -/// 

37 - / 

62 -// 

87 - 

13 -/ 

38 - 

63 - 

88 -III! 

14 -/ 

39 - / 

64 -{HI! 

89 -{HI 

15 -/ 

40 -/// 

65 - 

90 -/HI / 

16 -III 

41 - 

66 - / 

91 - Iff 

17 - 

42 -//// 

67 -// 

92 -/ 

18 -MJi 

43 -/ 

68 - {HJ n 

93 -/ 

19 - 

44 -/HJ! 

69 -// 

94 -/ 

20 -/ 

45 -/W! 

70 - / 

95 -/// 

21 -// 

46 -/// 

71 -/ 

96 - 

22 -/HJ 

47 - 

72 - HH 

97 - /ft/ / 

23 -// 

48 -/// 

73 - 

98 - / 

24 - 

49 -//// 

74 -//// 

99 - 

25 -/ 

50 -HU 

75 -/ 

00 -// 


Fig. 16 

e. If the student will bring to bear upon this problem tho principles 
ho learned in Section V of this toxt, ho will soon roalizo that what he 
now has boforo him are four, simple, monoalphabotic frequency distributions 
similar to those involved in a monoalphabotic substitution cipher using 
standard cipher alphabets. The realization of this fact immediately pro¬ 
vides tho clue to tho next stop: "fitting each of the distributions to 
the normal" (Soo Par. 17 b). This can be done without difficulty in this 
caso (remembering that a 25-lottor alphabet is involvod and assuming that 
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I and J aro tho sam.o letter) and tho following alphabets result: 


01 


I-J 

26 

_ 

u 

51 


N 

- 76 

_ 

E 

f. Tho keyword.is soon to 

02 

_ 

K 

27 

- 

V 

52 

- 

0- 

• -77 

- 

F 

bo JUNE and tho first fow groups 

03 

_ 

L 

28 

_ 

w • 

53' 


P 

78 

- 

G 

of tho cryptogram dociphor as 

04 

_ 

M 

29 


X 

54 

- 

Q 

79 

- 

H 

follows: 

05 

_ 

N 

30 

_ 

Y 

55 

_ 

R 

80 

- 

I-J 


06- 

i. 

0 

31 


Z 

56 

_ 

S 

81 

_ 

K . 

• 68 32 10 90 22 48 05 76 51 

07 

_ 

P 

32 

_ 

A 

57 

. 

T 

82 

- 

L 

SAST.JSItNEN 

08 

_ 

Q 

33 

_ 

B 

58 

_ 

U 

83 

_ 

M 


09 

_ 

R 

34 

- 

C 

59 

_ 

V 

84 

_ 

N 

11 88 64 84 20 36 45 23 

10 

_ 

S 

35 

_ 

D 

60 

- 

W 

85 

_ 

0 

TRANGEOF 

11 

_ 

T 

36 

- 

E 

61 

_ 

X 

86 

_ 

P 


12 

_ 

U 

37 

- 

F 

62 

_ 

Y 

87 

- 

Q 

g. From tho dotailod pro¬ 

13 

- 

V 

38 

. 

G 

63' 

- 

Z 

88 

- 

R 

cedure given abovo, tho studont 

14 

_ 


39 

_ 

H 

64 

_ 

A 

89 

- 

S 

should bo ablo to draw his own 

15 

_ 

X 

40 

_ 

I-J 

65 

_ 

B 

90 

_ 

T 

conclusions as to tho procoduro to 

16 

_ 

Y 

41 

- 

K 

66 

- 

G 

91 

- 

U 

bo followed in solving cryptograms 

17 

- 

Z 

42 

- 

L 

67 

- 

0 

92 

- 

V 

producod by mothods which aro moro 

18 

- 

A 

43 

_ 

M 

68 

- 

E 

93 

- 

W 

or loss simple variations of that 

19 

_ 

B 

44 

- 

N 

69 

- 

F 

94 

_ 

X 

just discussod. In this connection 

20 

- 

C 

45 

- 

0 

70 

_ 

G ' 

95 

- 

Y ■ 

ho is roforrod to Par. 53 of Spocial 

21 

_ 

D 

46 

- 

P 

71 

- 

H 

96 

- 

Z 

Toxt No. 165, Elomontary Military 

22 

- 

S 

47 

- 

Q 

72 

. 

I-J 

97 

-■ 

A . 

Cryptography, vhoroin a fow of 

23 

- 

F ' 

48 

- 

R 

73 

- 

K 

98 

- 

b r 

thoso variations arc montionod. 

24 

- 

G 

49 

- 

S 

74 

- 

L 

99 

_ 

C 


25 

- 

H 

50 

- 

T 

75 

- 

M 

00 

- 

D 

39. Solution of a moro com- 


Fig. 17 plicatod cxamplo. - a. As soon as 

a boginnor in cryptography roalizos 

tho consoquoncos of tho fact that letters aro usod with groatly varying 
f'roquoncios in normal plain toxt, what sooms to him as a now idoa very 
spoodily comos to him. Why not disguise tho natural froquoncics by a systom 
of substitution using many oquivalonts, and lot tho numbors of oquivalonts 
assignod to tho various lottors bo moro or loss in direct proportion to 
tho normal froquoncios of tho lottors? Lot E, for example, have 13 or 
moro oquivalonts; T, 10; N, 9; otc., and thus (ho thinks) the onomy cryp¬ 
tanalyst can have nothing in tho way of tcll-talo or characteristic fro¬ 
quoncios to uso as an ontoring wodgo. 

b. If tho toxt available for study is small in amount and if tho 
variant values aro wholly-indopondont of ono another, tho problom can be¬ 
come oxcoodingly difficult. But in practical military communications such 
raothods aro raroly oncountorod, bocauso tho volume of toxt is usually groat 
enough to pormit of tho ostablishmont of equivalent . valuos . To illustrate 
what is mount, suppose a sot of cryptograms produced by tho monoulphabotic- 
vnriunt method described above shows tho following two sots of groupings in 
tho toxt: - . 
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Sot A 


12-3'7-02-7 9-68-13-03-37-77 
82-69-03-79-13-68-23-37-35 
82-69-51-16-13-13-78-05-35 
91-05-02-01-68-42-78-37-77 


71-12-02-51-23-05-77 

11-82-51-02-03-05-35 

11-91-02-02-23-37-35 

97-12-51-03-78-69-77 


An examination of those groupings would load to the following tentative 
conclusions with regard to probable equivalents! 


12, 82, 91 01, 16, 79 03, 23, 78 
05, 37, 69 13, 42, 68 35, and 77 
02, and 51 


The establishment of those equivalencies would sooner or lator load to 
the finding of additional sots of equal values. The completeness with 
which this can bo accomplished will determine tho oaso or difficulty of 
solution. Of course, if many equivalencies can be established the problem 
can then bo reduced practically to monoalphabotic terms and a speedy solu¬ 
tion can bo attained. 


c. Theoretically, the determination of oquivalencios .may seem to bo 
quite an easy matter, but practically it may be very difficult, because 
the cryptanalyst can never be certain that a combination showing what may 
appear to be a variant value is really such, and is not a different word. 
F«r oxamplo, take the groups 

17-82-31-82-14-63, and 
27-82-40-82-14-63 


Hero one might suspect that 17 and 27 represent the same lottor, 31 and 
40 another letter. But it happens that ono group roprosonts tho word 
MANAGE, tho other DAMAGE. 

d. When reversible combinations are used as variants, the problem 
is perhaps a bit moro simple. For oxamplo, using tho accompanying Fig. 18 
for encipherment, two messages with tho same initial words, REFERENCE YOUR, 
may be enciphered as follows: 



K,Z 

Q,v 

B,H 

M,R 

D,L 

\7,S 

N 

H 

A 

0 

E 

F,X 

D 

T 

M 

F 

P 

G, J 

Q 

B 

U 

I 

V 

C,N 

G 

X 

R 

G 

B 

P,T 

Z 

L 

Y 

W 

K 


Fig. 18 
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REFER 3 N C _ E Y 0 U _R 

(1) N H W D R X L S H C D’TITZtl R S L H P S R B J C H 

(2) CH37R X S L H N D './ Z W N R L S H P R \1 J B N H 

The experienced cryptanalyst, noting-the appearance of the very first 
few groups, assumes that ho. is fcftre confronted with a case involving 
biliteral reversible equivalents, with variants. 

e. The probable-w^rd method of solution may be used, but with a 
slight variation introduced by virtue of t„he fact that, regardless of 
the system, letters of low frequency in plain text remain infrequent . 

Hence, suppose a word containing low-frequency letters, but in itself 
a rather common word strikingly idiomorphic in character is sought as 
a "probable word"; for example, words such as CAVALRY, ATTACK, and 
PREPARE. 'Yriting such a word on a slip of paper, it is slid one interval 
at a time under the text, which has been marked so that the high and low- 
frequency characters are indicated. Each coincidence of a low-frequency 
letter of the text with a low-frequency letter of the assumed word is ex¬ 
amined carefully to see whether the adjacent text letters correspond in 
frequency with the other letters of the assumed word; or, if the latter 
presents repetitions, whether there are correspondences between repeti¬ 
tions in the text and those in the word. Many trials are necessary but 
this method will produce results when the difficulties are otherwise too 
much for the cryptanalyst to overcome. 

40. A subterfuge to prevent decomposition of cipher text into com¬ 
ponent units. - a. A few words should be added with regard to certain 
subterfuges which are sometimes encountered in monoalphabetic substitution 
with variants, and which, if not recognized in time, cause considerable 
delays. Those have to deal v/ith tho insertion of nulls so as to prevent 
the cryptanalyst from breaking up tho text into its real cryptographic 
units. The student should take caroful note of tho last phraso; the moro 
insertion of symbols having tho same characteristics as the symbols of tho 
cryptographic toxt, except that they have no moaning, is not what is meant. 
This class of nulls rarely achieves tho purpose for which they aro intended, 
"ilhat is really meant can best be explained in connection with an example. 
Suppose that a 5 x 5 checkerboard design with tho row and indicators shown 
in Fig. 19 i s adopted for encipherment. Normally, tho cipher units would 
consist of 2 -lottor combinations of tho indicators, invariably giving the 
row indicator first (by agreement). 
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v g i vr d 

A H P S M 

T 0 E ■ B N 

F U R L C 

V,A,T,F 
G,H,0,U 
I,P,E,R 
V7,S,B,L 
0 ,M ,'N, C 


The phrase COMMANDER OF SPECIAL TROOPS might be enciphered thus: 

C0MMAND3R0F 
VI EB PH IU FT IE AB Tli ’70 K7 GT 


A 

B 

C 

D 

E 

F 

G 

H 

I-J 

K 

L 

M 

N 

0 

P 

Q 

R 

S 

T 

U 

V 

W 

X 

Y 

Z 


Fig. 19 


Those would normally then be arranged in 5-lotter groups, thus: 

VIEBP HIUFT I E A B T M ’7 0 P 7 GT... 

b. It will be noted, however, that only 20 of tho 26 lotters of the 
alphabet have been employed as row and column indicators, leaving J, K, 

3, X, Y, and Z unused. Now suppose those five letters are used as nulls, 
not in pairs , but as individual letters inserted at random just before 
the real text is arranged in 5-lettor groups. Occasionally, a pair of 
nulls is inserted. Thus, for oxamplo: 

VIEXB PHKIU FJXTI EAJBT M \7 0 Q P V/GKTY 

The cryptanalyst, after somo study, suspecting a biliteral cipher, pro¬ 
ceeds to break up tho text into pairs: 

VI EX BP HK IU FJ XT IE AJ BT 1W OQ GK TY 

Compare this set of 2-lottor combinations with tho correct sot. Only 3 
of the 15 pairs are "proper" units. It is easy to soo that without a 
knowledge of the oxistcnco of tho nulls, and even with a knowledge, if ho 
does not know which letters are nulls, tho cryptanalyst would bo confronted 
with a quito difficult problem, for tho solution of which a vory large 
amount of text might be necessary. Tho careful employment of the variants 
also very materially adds to the security of tho mothod bocauso repeti¬ 
tions can be rather effectively suppressed. 
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; ,£» 'From the cryptographic standpoint, the fact that in this system 
the cryptographic text is more than twice as long as the plain text con¬ 
stitutes ,4 serious disadvantage. From the cryptanalytic standpoint, the 
masking of the cipher units constitutes the most important source of 
strength of the system; this, coupled with the use of variants, makes it 
a quite difficult system to solve, despite its monoalphabeticity. 



SECTION IX 

.POLYGRAFHIC SUBSTITUTION SYSTEMS 

Paragraph 


Monographic and polygraphic substitution systems . 41 

Tests for identifying digraphic substitution . 42 

General procedure in the analysis of digraphic substitution ciphers 43 
Analysis of digraphic substitution ciphers based upon 

4-square checkerboard designs . . . 44 

Analysis of ciphers based upon other types of checkerboard designs 45 
Analysis of the Playfair cipher system...46 


41. Monographic and polygraphic substitution systems. - a. The 
student J is now referred to Sections VII and VIII of Special Text No. 166, 
Advanced Military Cryptography, wherein polygraphic systems of substitu¬ 
tion are discussed from the cryptographic point of view. These will now 
be'discussed from the cryptanalytic point of view. 

__b. Although "the essential differences between polyliteral and poly¬ 
graphic substitution are treated with some detail in Pars. 29 and 30 of 
Special Text No. 166, a few aaditional words on the subject may not be 
amiss at this point. 

_.c.. The two primary divisions of substitution systems into (1) mono¬ 
literal and polyliteral methods and into (2) monographic and polygraphic 
methods are both based upon considerations as to the number of elements 
-constituting the plain-text and the equivalent cipher-text units. In 
monoliteral as well as in monographic substitution, each plain-text unit 
ccfnsists’of a single element and each cipher-text unit consists of a 
single element. The two terms monoliteral and monographic are therefore 
identical in significance, as defined cryptographically. It is when the 
terms polyliteral and polygraphic are examined that an essential difference 
is seen. In polyliteral substitution the plain-text unit always consists 
of a single element (one lettor) and the cipher-text unit consists of a 
group Of two or more elements; when bilitoral, it is a pair of elqments, 
when triliteral, it is a set of three elements, and sc on. In what will 
herein bo designated as true or complete polygraphic substitution the 
plain-text unit consists of two or more elements forming an indivisible 
compound s the ciphor-text unit usually consists of a corresponding number 
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of elements. 1 When the number of elements comprising the plain-text 
units is fixed and always two, the system is digraphic t whon it is always 
three, the system is trigraphic , and so on.** It is impertant to note 
that in true or complete polygraphic substitution the elements combine 
to form indivisible compounds having properties difforont from thoso of 
either of the constituent letters. For example, in monoliteral substitu¬ 
tion ABp may yield XY C and AC c may yield XZ C ; but in true digraphic sub¬ 
stitution 3Ep may yield XY C and SUp may yield £JN C . A difference in 
identity of one letter affects the whole result. An analogy is found 
in chemistry, when two elements combine to form a molecule, the latter 
usually having properties quite different from those of either of the 
constituent elements. For examples sodium, a metal, and chlorine, a 
gas, combine to form sodium chloride, common table salt. Furthermore, 
sodium and fluorine, also a gas similar in many respects to chlorine, 
combine to form sodium fluoride, which is much different from table salt. 
Partial and pseudo-polygraphic substitution will be treated under sub- 
paragraphs d and e below. 

d. Another way of looking at polygraphic substitution is to regard 
the elements comprising the plain-text units as being enciphered indi¬ 
vidually and polyalphabetically by a fairly large number of separate 
alphabets. For example, in a digraphic system in which 676 pairs of plain¬ 
text letters are representable by 676 cipher-text pairs assigned at random, 
this is equivalent to having a sot of 26 different alphabets for encipher¬ 
ing one member of the pairs, and another set of 26 different alphabets for 
enciphering tho other member of the pairs. According to this viewpoint 
the different alphabets are brought into play by the particular combination 
of letters forming each plain-text pair. This is, of course, quite differ¬ 
ent from systems whorein the various alphnbots are brought into play by 
more definite rules; it is perhaps this vory absence of definite rules 
guiding the selection of alphabets which constitutes the cryptographic 
strength of this typo of polygraphic system. 

e. ’.Then regarded in t.ho light of the preceding remarks certain 
systems which at first glance seem to be polygraphic, in that groupings 
of plain-text letters are troatod as units, on closer inspection are seen 


The qualifying adverb "usually” is employed because this correspondence 
is not essential. For example, if one should draw up a set of 676 arbi¬ 
trary single signs, - it would bo possible to represent the 2-lettor pairs 
from AA to ZZ by single symbols. This would still be a digraphic system. 

** In this sense a code system is merely a polygraphic substitution system 
in which tho number of elements constituting the plain-text units is vari¬ 
able. 

^ For this reason the two letters are marked by a ligature, that is, by a 
bar across their tops. 


REF ID:A64644 


. “- 87 

$o be^only partially polygraphia, or pseudc-polygraphic in character, 
for example,’ in a' system in which encipherment is by pairs and yet one 
of the letters in each pair is enciphered monoalphabetically, the other 
letter', polyalphabetically, the method is only pseudo-polygraphic.. Cases 
qf this type are shown in Par. 31 of Special Text No. 166, Advanced Mili¬ 
tary 1 tlrypxography. Again, in a system in which encipherment is by pairs 
and'the encijpherraents of the left-hand and right-hand members of the pairs 
show’group’relationships, this is not pseudo-polygraphic but only partially 
polygVaphi’c. * Cases of this type are shown in Pars. 33-37, Special Text 
No. 166. . 

f. The fundamental purpose of polygraphic substitution is again the 
suppression of the frequency characteristics of plain text, just as is 
the ca se in monoalphabetic substitution with, variants; but here this is 
acc’SmpYis^ied by a’ different method, the latter arising from a somewhat 
different approach ,to the problem involved in producing cryptographic se¬ 
curity. When the substitution involves replacement of single letters in 
3- aflflgalpViafe'etic system, the cryptogram can be solved rather readily. 
Basically the reason for' this is that the principles of frequency and the 
laws^of probability, applied to individual units of the text (single 
letters^, have a very good opportunity to manifest themselves. A given 
volume^of text of sayji plain-text letters, enciphered purely monoalphabeti- 
ehlly,^ affords n‘ cipher characters, and the same number of cipher units. 

Jhf samp 'volume of text , enciphered digraphically, still affords n cipher 
characters but only n cipher units. Statistically speaking, the sample 

witlurf'whihh the laws of probability now apply has been cut in half. 
Furthermore, from the point of view of frequency, the very noticeable 
diversity in the frequencies of individual letters, leading to the marked 
cbpstb and' troughs of the monoliteral frequency distribution is no longer 
so strikingly in evidence in the frequencies of digraphs. Therefore, 
although true digraphic encipherment, for example, cuts the cryptographic 
textuaiuhits' in half, the difficulty of selution is not doubled, but, 
if a'matter of* judgment arising from practical experience can be expressed 
or’ approximated mathematically, squared or cubed. 

- •'V- 

g. . Sections VII and VIII of Special Text Nr. 166‘show various methods 
for the derivation of polygraphic equivalents and for handling these 
equivalents in cryptographing and decryptographing messages. The most 
practicable of those methods are digraphic in character and for this reason 
their solution will be treated in a somewhat more detailed manner than 
will trigraphic methods. The latter can be passed over with the simple 
statement, that their analysis requires much text to permit of solution by 
.the frequency method, and hard labor. Fortunately, they are infrequently 
encountered‘"because they are difficult to manipulate without extensive 
tables. 1 If the latter are required they must be compiled in the form of 

-*-A patent has been granted upon a rathor ingenious machine for automati¬ 
cally accomplishing true polygraphic substitution, but it has not been 
placed upon the market. See U. S. Patent No. 1,845,947 issued in 1932 to 
Weisner and Hill. In U. S. Patent No. 1,515,680 issued te> Henkels in 1924, 
there is.described a mechanism which also .produces polygraphic substitution. 
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a book or pamphlet. If one is willing to go that far, one might as well 
include in such documont more or less extensive lists of words and phrases, 
in which case the system falls under the category of code and not cipher. 

42. Tests for identifying digraphic substitution. - a. Tha tosts 
which are applied to determine whether a given cryptogram is digraphic in 
character are usually rather simple. If there are plenty of repetitions 
in the cryptogram and yet the monolitoral-frequency distribution gives no 
clear-cut indications of monoalphabeticity; if most of tho repetitions 
contain an even number of letters? and if the cryptogram contains an even 
number of letters, it may be assumed to bo digraphic in nature. 

b. The student should first try to determine whether the substitu¬ 
tion is completely digraphic, or only partially digraphic, or pscudo- 
digraphic in character, as aro the cryptograms produced by tfie methods 
indicated in Far. 31 f to i of Special Text No. 166, Advanced Military 
Cryptography. As mentioned above, there are cases in which, although the 
substitution is effected by taking pairs of letters, one of tho members 

of the pairs' is enciphered monoalphabotically, the other member, nolyalpha- 
botically. A distribution based upon the letters in tho odd positions and 
one based upon those in the. even positions should bo made. If one of these 
is clearly monoalphabetic, then this evidence*that the message represents 
a case pf pseudo-digraphisra of the type hero described. By attacking tho 
monoalphabetic portion of the messages, solution can soon be reached by 
slight variation of tho usual method, the polyalphabetic portion boing 
solved by the aid of the context and considerations based upon tho probable 
nature of the substitution chart (see Tables 2, 3, and 4 of Special Text 
No. 166). It will bo noted that tho charts referred to show definite 
symmetry in thoir construction. 

c. On the other hand, if the foregoing stops prove fruitless, it may 
bo assumed that the cryptogram is complotoly digraphic in character. 

d. Just as certain statistical tosts may bo applied to a cryptogram 
to establish its monoalphabeticity, so also may a statistical tost bo 
applied to a cryptogram for tho purpose of establishing its digraphicity. 
Tho nature of this tost and its method of application will bo discussed 
in a subsequent text. 

43. Gonoral procedure in tho analysis of digraphic substitution 
ciphers. - a. Tho analysis of cryptograms which have boon produced by 
digraphic substitution is accomplished largoly by tho application of tho 
simple principles of frequency of digraphs, with t,ho additional aid of 
such special circumstances as may bo known to or suspected by tho cryptana¬ 
lyst. Tho latter refer to peculiarities which may bo tho rosult of the 
particular method employed in obtaining tho equivalents of the plain-text 
digraphs in tho cryptographing procoss. In gonoral, however, only if 
there is sufficient text to disclose the normal phenomena of repetition 
will solution be feasible or possible. 
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]j. However, whon i digraphic system is omployod in rogular service, 
there is littlo doubt but that traffic will rapidly accumulate to an 
araouht more than sufficient to porait of solution by simple principles of 
frequency. Scmetimos only two or throo long messages, or a half dozen of 
avdrage length are sufficient. For v/ith the identification of only a few 
cipher digraphs, larger portions of messages may be read because the skele¬ 
tons of words formed from the few high-frequency digraphs very definitely 
limit the values that ban be inserted for the intervening unidentified 
digraphs. For example', suppose that the plain-text digraphs TH, ER, IN, 

IS, OF, NT, and TO have been identified by frequency considerations, cor¬ 
roborated by a tentatively identified long repetition; and suppose also 
that the enemy is knowh to be using a method which yields reciprocal equiva¬ 
lents between plain and cipher-text digraphs, as for instance the quadri- 
cular table shown in Ha'r. '31 a of Special Text No. 166. Suppose the message 


begins 

as follows (in which the 

assumed 

value3 

have 

been inserted): 

XQ 

VO 

ZI 

LK AI* 

OL 

ZX 

PV QN 

it OL 

UK 

AL HN LK VL 

FO 


TH 

IN 

NT 


RE 

NT 

NO 

IN 

' BN 

OZ 

KU 

DY EL 

Ljj] 

YW 

• • • • 

m 9 



SI 


ON 

TO 








The words ’FOURTH INFANTRY REGIMENT are readily recognized. The reciprocal 
pairs St and LE C suggest'ATTACK. The beginning of the message is now com¬ 
pletely" disclo'seds ' FOURTH INFANTRY REGIMENT NOT YET IN POSITION TO ATTACK. 
The values more or less automatically determined are VO = UR , AL p s TY , 

HN a ET, VL = PO . 0Z„ = TI , YW = CK . 
c p’ c p* c p 7 c p 

_g. . Once a good start has been made and a few words have been solved, 
subsequent work is quite simple and straightforward. A knowledge of enemy 
correspondence., including data regarding its most common words and phrases, 
is of great assistance in breaking down new digraphic tables of the same 
nature but with different equivalents 

jJ,. The remarks, made in above also apply to the details of solution 
in cases of partially digraphic substitution. 

■ 44. Analysis of digraphic substitution ciphers based upon 4-square 

.■pfreckprboard designs. - a. In Sectirn VIII of Special Text No. 166, 
•Ajlyp.nced Military Cryptography, there are shown various examples of di- 
graphic. substitution based upon the use of checkerboard designs. These 
ta&y be considered cases of .partially digraphic substitution in that in the 
checkerboard system there are certain relationships between plain-text 
digraphs' having common elements and their corresponding cipher-text di- 

f haphs, which will also have cowmen elements. For example, take the 
ollowing 4-square checkerboard designs 
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B 

W 

G 

R 

M 

0 

P 

A 

U 

L 

N 

Y 

V 

X 

E 

H 

Z 

Q 

D 

F 

S 

I 

C 

T 

K 

K 

I 

T 

S 

C 

U 

P 

L 

A 

0 

M 

w 

R 

B 

G 

D 

Z 

F 

Q 

H 

E 

Y 

X 

N 

V 

W 

A 

L 

_E 

S 

JL 

X_ 

K 

JL 

J.. 

_F 

H 

JJ 

I_ 

T_ 

JL 

M_ 

Y 

D 

X. 

P 

_X 

_B 

K_ 

C 

s 

A. 

JL 

w_ 

_L_ 

Ji¬ 

z 

R 

A. 

_G , 

G . 

_z_ 

,.Q. 

.N . 

Ji¬ 

ll 

M 

V 

Y 

0 

T 

H 

i 

F 

ll 


Fig. 20 

Here BG p = 0W C , B0 p * 0F o , BS p = 0P C , BG p - 0N C and BT p = 0D C . In each 
case when B p is the initial letter cf the plain-text pair, the initial 
letter of the cipher-text equivalent is 0 C . This, of course, is the direct 
result of the method; it means that the encipherment is monoalphabetic for 
the first half of' each of these five plain-text pairs, polyalphabetic for 
the second half. This relationship holds true for four other groups of 
pairs beginning with B p . In other words, there are five alphabets employed, 
not 25. Thus, this case differs from the case discussed under Par. 42 b 
only in that the monoalphabeticity is not complete for one half of all the 
pairs, but only among the members of certain groups of pairs. In a com¬ 
pletely digraphic system using a 676-cell randomized square, (for example, 
the cipher square illustrated in Par. 31 a of Special Text No. 166) such 
relationships are entirely absent and for this reason the system is cryp¬ 
tographically more secure than the checkerboard system. 

b. From the foregoing, it is clear that when solution has progressed 
sufficiently to disclose a few values, the insertion of letters within the 
cells o'f the checkerboard design to give the plain-text and cipher relation¬ 
ships indicated by the solved values immediately leads to the disclosure of 
additional Values. Thus, the solution of only a few values soon leads to 
the breakdown of the entire checkerboard design. 

c. (l) The following example will serve to illustrate the procedure. 
Let the message bo as followst 
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' 1 £ $ 4 5 

6 7 8 9 10 

1132 1314 15 

36 1738392) 

ZL 22 23 34 25 

35 2728 39 30 

_h> c a 'p 

ff 0 Q I L 

B S P K M 

N D U K E 

0 HQ N F 

B 0 R ti V 

B.‘^ C'L C H 

Q B'Q B X 

H ft AFX 

S I 0 K 0 

Q Y F N S' 

X M C G Y 

G- tXl.B.B,. 

JC A t D X 

LPMXH 

H R G K G 

Q K ‘g M L 

„ X BL9.S 1 

D. G 0 I H M 

U E 0 R 1) 

CLTUL 

SOQCS 

QNHFX. 

I F B E X 

E. ,: nBUl 5 

fcsq 0 

Ym A f t 

X S Y C B 

E p‘f N B__ 

S P K N U 

F. Q I T X E 

U Q M L F 

E Q 0 I G 

0 I E U E 

H P I A N 

Y T F L B 

G. F E E P I 

DHPCS 

N Q I H B_ 

F H M H F 

X C K-U P 

D G Q P N 

H. C B C Q L 

q P N F N 

P N I T 0 

R T E N C 

• B C N T_ 

F H H A Y 

I. % L QC I 

A A I Q U 

C H T P G 

B I F G W 

K F C Q S 

LQMCB 

J. IYCRQ 

Q D P R X 

F N q M L 

F I D G C 

0 Cf I 0 G_ 

0 I H H F 

K. ^ R ( C G 5 

G N D L N 

0 Z T F G 

E E R R P 

I F H 0 T_ 

F H H A Y 

L - UA. ° 

aaiq'u 

'C H T P 





(2) The cipher having been tested for standard alphabets 
"(by the method r f completing the normal components) and found t» 
give negative results, a raoneliteral-frequency distribution is 
made. It is as follows* 



ABC DE FGHIJKLMNCPQ RSTUVV7XYZ 
11 15 26 8 16 30 17 22 24 0 8 ' 14 11 18 15 16 33 9 6 11 11 0 1 12 7 3 


Fig. 21 
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(3) At first glance this may appear to the untrained ey# 
to be a nton'r alphabetic frequency distribution but upon cloa’er 
inspection it is noted that aside from the frequencies of four 
or five letters the frequencies_for the remaining letters are . 
not very dissimilar. There are, in reality, no very marked 
crests and troughs, certainly-not as.many a-s wculd be expected, 
in a monoalphabetic substitution cipher of equal length. 

(4) The message is now carefully examined for repetitions 
of 4 or more letters•*.. Hpri> are all of them: 


... —Frequency 

TFHHAYZLQCIAAIQUCHTP (20 letters) 2 

QMLFEQQIGOI (ll letters) . 2 

XIFBSX (6 letters) " ‘ 2 

FEQQ . . 3 

QMLF 3 

BEHM * - 2 

BSPK 2 

GOIH . . 2 


Located in Lines . 
H and K 
C and F 
C and D 
0, D, F 
C, F, J 
B and G 
A and E 
J3_and J 


.Since there are quite a few repetitions, two of considerable . 
length, since all but one of them contain an even number of 
letters, and since the message also contains an even number 
of letters, 344, digraphic substitution is suspected. 

The cryptogram is transcribed in 2-l.etter groups, for greater 
convenience in study. It is as follows: 

Message transcribed in pairs 



1 

2 

3 

4 

5 

6 

* 

8 

9 

10 

11 

12 

13 

14 

15 

A. 

HF 

GA 

FG 

OQ 

IL 

BS 

PK 

MN 

DU 

KE 

OH 

QN 

FB 

OR 

UN 

B. 

QC 

LC 

HQ 

BQ 

BF 

HM 

AF 

XS 

10 

KO 

QY 

FN 

SX 

MC 

GY 

C. 

XI 

FB 

EX 

AF 

DX 

LP 

MX 

HH 

RG 

KG 

QK 

iL 

LF 

EQ 

QI 

D. 

GO 

Jffl 

MU 

EO 

RD 

CL 

TU 

FE 


CG 

QN 

HF 

XI 

FB 

EX 

E. 

FL 

BU 

QF 

CH 

QO 

QM 

AF 

TX 

SY 

CB 

EP 

FN 

BS 

FK 

NU 

F. 

QI 

TX 

EU 

QM 

LF 

EQ 


GO 

_IL’ 

UE 

HP 

IA 

NY 

TF 

LE 

G. 

FE 

EP 

ID 

HP 

CG 

NQ 

IH 

BF 

HM 

HF 

XC 

KU 

PD 

GQ 

PN 

H. 

CB 

CQ 

LQ 

FN 

FN 

PN 

IT 

OR 

TE 

NC 

CB 

CN 

TF 

HH 

AY 

J. 

ZI 

JKL 

IA 

AI 

_2IL 

CH 

TP 

CB 

IF 

GW 

KF 

CQ 

RL 

QM 

CB 

K. 

OY 

CR 

QQ 

DP 

RX 

FN 

2M_ 

LF 

ID 

GC 

CG 

ie 

GO 

IH 

HF 

L. 

IR 

GG 

GG 

ND 

LN 

OZ 

,TF 

GE 

ER 

RP 

IF 

HO 

TF 

HH 

AY 

M. 

ZL 

_ac_ 

IA 

AI 

-fiLL 

CH 

TP 
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It is rioted‘that all the repetitions listed above break up properly 
into "(K^raphs except in one case* y_iz., _ FEQQ in lines C, D, and F. 

This sterns*"rather strange, arid at first thought one might suppose 
that a letter dropped out or was added in the vicinity of the FEQQ 
in line 5. But it is immediately seen that the FEQQ in line D has 
no relation at .all to the .F EQ Q. in lines C and F, and that the FEQQ 
in line j) is merely an accidental repetition. 

(5) A digraphic frequency distribution is made and is shown in. 
Fig. 22. 

i B C tf'fi F G IT I K L- M N 0 P Q R S T U V W X Y Z 


■■■■■■■■■■■■■■BHHDnunl 


IBMI 


IBBBBfl 


bbbbbbbbbbbbbdbbbbbbbdbbbi 

iBBBBBOBEIBBBDBBDBBBBflBBBBil 


BBflflBBBBBBBBflflBBBBBBBBBflfl 

BBBBBBBBBBBBBBB3BBBBBBBBS 

mamuMmummmmmmmmmmmmmmmmmm 

BBBBBBBBBBBBBBBBBBBBBBBBB 
BBBBBBBBBBBBBBBBBBBBBBBBB 
BBBBBBBBBBBBBBBBBBBBBBBBB 

bbbbbbbbbbbbbbbbbbbbbbbbH 
bbbbbbbbbbbbbbbbbbbbbbbbbI 

BBBBBBBEBSBBBBBBBHBBHBal 
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(6) The appearance of the digraphic distribution for 
this message .is quite, characteristic of that for a digraphic 
substitution cipher. Thera are many blank cells; although 
there are many cases in which a digraph appears only once, 
there are quite a few in which a digraph appears two or 
three times, four casos in which a digraph appears four 
times, and two cases in which a digraph appears five times. 
The absence of the letter J is also noted; this is often 
tho case in a digraphic system based upon a checkerboard 
design. 

(7) In another common type «f checkerboard system 
known as the Playfair cipher, described in Par. 46, one 
of the telltale indications besides the absence of the 
letter J is the absence of double letters, that is, two 
successive identical letter's. Tho occurrence of the double ’ 
letters GG, HH and Q'} in the message under investigation 
eliminates the possibility of its being a Playfair cipher. 
The simplest thing to assume is that a 4-squaro checker¬ 
board is involved. One with normal alphabets in Sections 

1 and 2 is therefore set down (Fig. 23 g.). 


: A 

B 

c 

D 

E 






F 

G 

H 

I-J 

K 






L 

' M 

N 

0 

P 






'■Q 

R 

S 

T 

U 






V 

T V 

X 

Y 

z 











A 

B 

0 

D 

JL 






F 

G 

H 

U 

K 






_L_ 

a 

N 

0 

p 





Hi 

A. 

R 

S 

T ! 

u 




j 

□ 

V 


_x_ 

E 

z 


Fig. 23 a. 

(8) The recurrence of tho group QMLF, three times, 
and at intervals suggesting that it might be a sentence 
separator, loads to the assumption that it is the word 
'STOP. The letters Q, M, L, and F are therefore inserted 
in the appropriate cells in Sections 3 and 2 of the dia¬ 
gram. Thus ’(Fig. 23 b)! 
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A 

B 

c 

D 

E 






F 

G 

K 

J-J 

IC 






L 

M 

K 

0 

p 





L 

Q 

R 

S 

T 

u 




Q 


V 

W 

X 

y 

z 












_B_ 

c 1 

_D_. 

E 






F_ 

_G_ 

JL 

I^J 

K 




F 



JL 

Jj 

0 

_P_ 



M 



_Q_ 

R 

s 

_T_ 

JL 






V 

W 

X 

y 

z 


Fig. 25 b 

# 

These placements seem logical. Moreover, in Section 3 the 
number of colls between L and Q i3 just one less _fchan enough to 
contain all the letters M to P, inclusive, and suggests that 
either N nr 0 is in the keyword portion of the sequence, that 
is, near the top of Section 5. Without making a commitment in 
the matter, suppose bpth N and 0, for the present, be inserted 
in the cell between M and P. Thus (Fig. 23 c.): 


A 

B 

C 


E 






F 

G 

H 

iJ 

K 



_ 



,L_ 

JL 

_N_ 

0 

P 





L 

JL 

R 

s 

JL 

JJ_ 

JL 

I 

p i 

_s_ 


_y_ 

ff 

X 

Y 

Z 



lI 
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(9) Now, if tho placement of P in Section 3 is 
correct, the cipher equivalent of TH p will be P0 C , and 
there should be a group of adequate frequency to cor¬ 
respond. Noting that PN C occurs throe times, it is as¬ 
sumed to be THp, and the letter N is inserted in the 
appropriate cell in Section 4. Thus (Fig. 23 d): 


A 

B 

G 

D 

E 






F 

G 

H 

I-J 

K 






L 

M 

N 

0 

P 





L 

Q 

R 

S 

T 

U 

M 

N^ 

n 

P 

Q 


V 

W 

X 

Y 

Z 











_A_ 

B 

c 

D 

_E_ 




JL 


_F_ 

_G_ 

H 

I-J 

_K_ 




F 


JL 

JL 

JL 

_0_ 

_P_ 



M 



A 

R 

_s_ 

JL 

_U 






_v_ 

W 

X 

Y 

z 


Fig. 23 d. 

(10) It is about time to try out these assumed 
values in the message. The proper insertions are made, 
with the following results: 



1 

2 

3 

4- 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

A. 

HF 

CA 

PG 

OQ 

IL 

JS-JEfL 

MN 

DU 

KE 

OH 

QN 

FB 

OR 

UN 

B. 

QG 

LC 

HQ 

BQ 

BF HM 

AF 

XS 

10 

KO 

Q2 

FN 

SX 

MC 

GY 

C. 

XI FB EX 

AF 

DX 

LP 

MX 

HH 

RG 

KG 

QK 

QM 

LF 

EQ 

QI 

D. 

JUH 

MU 

EO 

ra 

CL 

TU 

_FE 

QQ 

CG 

QN 

ST 

HF 

OP 

XI_ 

FB 

_EX 

E. 

FL 

BU 

QF 

CH 

CO 

QM 

ST 

AF 

TX 

SY 

CB 

EP 

FN 

gg...-BL 

NU 

F. 

QI 

TX 

EU 

QM 

LF 

EQ 

01 

GO 

IE 

UE 

HP 

IA 

NY 

TF 

LB 

G. 

FE 

EP 

ID 

StoT 

HP CG 

NQ 

IH 

M- HM. 

HF 

XC 

ICU 

PD 

GO 

PN 

TH 

AY 

H. 

CB 

CQ 

LQ 

_.IA_ 

PN 

FN 

PN 

IT 

OR 

TE 

NC 

CB 

CN 

TF 

IIH 

J. 

ZL 

__QC 

TH 

AL. 

—02 

TH 

CH 

TP 

CB 

IF 

GW 

KF 

CQ 

SL 

QM 

CB 
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1 2 3 4 5 6 7 S' ‘ 9 to *11 12' 13 14 15 

K. OY CR QQ DP RX ' FN QM L F ~IE Gc'cG 10 GO IH HF 

ST OP 


L. 1R CG GG ND LN 02 TF GE ER RP IF HO TF HH AY 



(ll) So far ns impossible combinations are in 
evidence. Beginning with group H4 in the"message is seen 
the following sequences 


1 . • NFNPli’ 

T H , . T A 

j ;: | , ! 

Assume it to be THAT THE. Then ATp s PN , ant ^ ^ et ‘* :er 
'N is to be inserted in row 4 column 1. But this is in¬ 
consistent with previous assumptions', since N in Section 
4 has already been tentatively placed in row 2 column 4 
of Section 4. Other assumptions for FN C are made: that 
it is IS (THIS TH...); that it is EN p (THEN TH...)» But 
the same inconsistency-is apparent. In fact,the student 
:Will see that FN C must represent a digraph ending in F, 

G, H, I-J, or K, since N c is tentatively located cn the 
.-■"i&mrs line as these letters in Section 2. Now FN C occurs 

’4 times in the message. The digraph it represents must 
(•be one of the following* 

DF, DG, EH, DI, DJ, DK 
IF, IG, IH, II, IJ, IK 
'■ . JF, JG, JH, JI, JJ, JK 

* OF, OG, OH, 01, OJ, OK 

TK, 

YF, YG, YH, YI, YJ, YK 

Of these the only one likely to be repeated 4 times is 
OF, yielding T H 0 F T H which may be a part of 
PNFNPN 

. NORTHOFTHE. or .SOUTHOFTHE. 

, CQLQPNFNPNIT CQLQPNFNPNIT 

t In either case, the position of the F in Section 3 is 
excellent: F . . . L in row 3. There are 3 cells 
intervening bet’ween’F and L, into which G, H, I-J, and 
K may be inserted. It is not nearly so likely that G, 

H, and K are in the keyword as that I should be in it. 
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Let it be assumed that this is the case, and let the 
letters be placed in the appropriate cells in Section ' 
3. Thus (Fig. 23 e). 


A 

B 

c 

D 

E 






F 

G 

H 

U 

K 






L 

M 

N 

0 

P 

F 

G 

H. 

K 

L 


R 

S 

T 

U 

M 

8 

P 

Q 


V 

W 

X 

Y 

Z 











A 

B 

C 

D 

Jet 




JL 


F 

_G 

H 

I-J 

K 




JL 


L 

M 

N 

0 

_P_ 



M 

JL 


_L 

_R_ 

S. 

T_ 

_L 






V 

JL 

X 

Y 

_z_ 


Fig. 23 e. 

Let the resultant derived values be checked against the 
frequency distribution. If the position of H in Section 
3 is correct, then the digraph 0N p , normally of high fre¬ 
quency should be represented several times by HF C . Ref¬ 
erence to Fig. 22 shows a frequency of 4 times. And HM C 
2 occurrences, represents NS^. There is no need to go 
through all the possible corroborations. 

(12) Going back to the assumption that T H . . T H 

P N F N P N 

is part of the expression .NORTHOFTHE.er 
CQLQPNFNPNIT 

. S0UTH0FTH3. , it is seen at once from Fig. 
CQLQPNFNPNIT 

23 e that the latter is apparently correct and not the 
former, because LQ C equals OU n and not OR^. If 0Sp = CQ 0 
this means that the letter G of the digraph CQ c must be 
placed in row 1 column 3 or row 2 column 3 of Section 3. 
Now the digraph CB c occurs 5 times, CG c , 4 times, CH c , 3 
times, CQ c , 2 times. Let an attempt be made to deduce 
the exact position of C- in Section 3 and the positions 
of B, G, and H in Section 4. 'Since F is already placed 
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•in Section 4, assume G and H directly fellow it,.and, 
that B comes before it.' How much before? Suppose a, 
trial be made. Thus (Fig. 23 f): 




A 

B 

C. 

D 

E 



c 



F 

G 

■H 

I-J 

•K 



c 



L 

M 

N 

0 

>P 

F 

G 

H 

K. 

L 

X 

R 

S 

i 

0 

M 

nr 

JL 

P 

> 


V 

W 

X 

t 

z 











A _ 

A 

C 

X 

X 




JL 


x 

± 

Jl 


JL 

B 

_B_ 

B 

X 

_G_ 

_L_ 

M 

N 

X 

p 

H 


k 

_Q_ 


X 

R 

S 

-T 

u 






X 

W 

X 

Y 

z 


Fig. 23 f.' 
'.Li; * : 


By referring now to the frequency distribution ^ip 
after a very few minutes, of .experimentation it.bec.ome 
apparent that the following Is correct: ‘ | 


A 

B 

C 

D 

E 



C 



F 

G 

H 

I-J 

K 

, 





ft 

M 

N 

0 

F 

•F‘ 

G 

H 

K 

L ' 

Q 

R 

S 

T " 

IT 

M 

i 

'P 

a 


V 

.w 

X 

Y 

Z 











A 

B 

G 

D 

X 




N 


F 

G 

H ‘ 

H 

K 

B 



F 

G 

'L' 

M 

N 

0 

P 

H 


M 

_Q_ 


_Q_ 

R 

S 

T 

X 






V 

W 

X 

Y , 

z 


Fig. 23 g. ’ 


I 
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cipher equivalents are taken from the sections containing the normal 
alphabets. For example, referring to Fig. 23 h, suppose that sections 
3-4 be used as the source of the plain-text pairs, and sections 1-2 as 
the source of the cipher-text pairs. Then OIIp - DG C , EHp = AU C , etc. 

(2) To solve a message enciphered in that e manner, it is necessary 
morely to make a square in which ail four sections are normal alphabets, 
and then perform two steps.. Firsts the cipher text pairs are converted 
into their normal alphabet equivalents morely by "deciphering" the mes¬ 
sage with that square; the result of this operation yields two mond- 
alphabots, one composed ol' the odd letters, the other^ of the even 
letters. The second step is to solve those t7o monoalphabets. , 

(3) Where the same mixed alphabet is inserted in sections 3 and 4> 
the problem is still easier, since the letters resulting from the conver¬ 
sion intp normal-alphabet equivalents all belong to the same,' single- 

.mixed alphabet. 

45. Analysis of ciphers based upon other types of checkerboard 
designs.'- The solution'of cryptograms enciphered by other typos of 
checkerboard designs is accomplished .along lines very similar to those set 
forth in" the- foregoing example of the solution of r. message, prepared by- 
means ‘of a 4-square checkerboard design. There are, unfortunately, no 
means of,tests which can be applied to determine in the early stages of 
the analysis exactly what type of design i3 involved in the first case 
urfdor study. The author freely admits that the solution outlined in sub- 
paragraph "_c is quite artificial in that nothing is demonstrated in step 
(7) that obyiously leads to or warrants the assumption that a 4-square 
checkerboard is involved. This point was passed over with the quite bald 
statement that this was "the simplest thing to assume" - and then the so¬ 
lution proceeds exactly as though this mere hypothesis has been definitely 
established. For example, the very first results obtninod were based upon 
assuming that a certain 4-letter repetition represented the word STOP and 
immediately inserting certain letters in appropriate cells in a 4-square 
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reject that hypothesis, and formulate a sec-nd hypothesis. In actual 
practice he may have to reject a second, third, fourth, ... nth hypothesis. 
In the end he may strike the right one - or he may not. There is no 
guaranty of success in the matter. In the third place, one of the objects 
of this text is to show hew certain‘systems, if employed for military pur¬ 
poses, can'readily be broken down.' Assuming that a checkerboard system 
is in use, and that daily changes in keywords are made, it is possible 
that the .traffic of the first day'might give considerable difficulty in 
solution, if‘the type of checkerboard ‘ were hot known to the cryptanalyst. 
But the second or third day's traffic would be easy to solve, because by 
that time the cryptanalytic personnel would have analyzed the system and 
thus learned what type of checkerboard'the enemy is .using. 


46. Analysis of the Playfair cipher system. - a. An excellent example 
of a practical, partially digraphic system is the Playfair cipher.^- 
It was used for a number of years as a field cipher by the British Army, 
before and during the World War, and for a short time, .also during that 
war, by field units of the American Expeditionary Forces* 


b. Published solutions 2 for this cipher are quite similar basically 
and vary only -in-minor details. The earliest, that .by Lieut. Ivi^jiborgne, 
used straightforward prinoiples of frequency to establish the values of 
three or four of the most frequent digraphs. • Then, on the. assumption 
that in most cases in which a keyword appears on the first and second rows, 
the last five letters of the normal alphabet, VrflCYZ, will rarely bo dis¬ 
turbed in sequence and will occupy the-last row of the square, he "juggles" 
the letters > given by the yaluos-tentatively.established from frequency 
considerations, placing-them, in various positions in the square, together 
with WXYZ, to correspond to the plain-text cipher relationshipst tentatively 
established.' a" later solution by Lieut. Frank Moorman, as described in 
Hitt's Manual, assumes that in a Playfair cipher prepared by means of a 
square in rfiich the keyword occupies the first and second rows, if a di¬ 
graphic frequency 'distribution is'made, it will be fduhd 'fhffi't ‘the "lottors 
having the .greatest combining power aro very probably letters of' : thp key. 

The latest published-solution by Lieut. Commander Smith, is perhaps the 


This cipher was really invented by Sir Charles Wheatstone but receives 
its name from Lord Playfair, who apparently was its sponsor before the 
British Foreign Office. See Wemyss Reid, "Memoirs of Lyon Playfair", 
London, 1899. 


Mauborgne, Lieut. J. 0. An advanced problem in cryptography and its 
solution, Leavenworth, 1914. 

Hitt, Captain Parker. Manual for the aolutioji of military ciphers, 
Leavenworth, 1918. 

Smith, Lieut. Command.er W. , U. S.. it. In "Cryptography" by Andre 
,Langie, f translated by,J. C. H. Macbeth, Now York, 1922. 
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fflo'fet lucid and systematized of the throe. He sets forth in definite 
language certain considerations wnieh the other two writers certainly 
entertained but failed to indicate. 

c_. The following details have beon summarized from Commander Smith's 
solutions 


(1) The Playfair cipher may be recognized by virtue of 
tho fact that it always contains an oven number of letters, 

.land that when divided into groups of two letters each, no 
group contains a repetition of the same letter, as NN or 
.S3, Repetitions of digraphs, trigraphs, and polygraphs 
will bo evident in fairly long messages. 

(2) Using the square 1 shown in Fig. 25 a, there are 
two general cases to be considored, as regards tho results 
of encipherment : 


B 

IT 

N 

K 

R 

D 

E 

F 

G 

H 

I-J 

L 

M 

0 

Q 

U 

P 

7T 

c 

T 

s 

V 


X 

Z 


- Fig. 25 a. 

Case 1. Letters at opposite corners of a rectangle. The following 
relationships are founds 

TH p = YF C 
HT_ = FY 


Reciprocity is complete. 

Case 2. Two letters in the same line or column. The following 
relationships are founds 


a V =NK c 

NA p = ™C 

1 The Playfair square acccmpanying Commander Smith's salutisn is based 
upon the keyword BANKRUPTCY, "to be distributed between the first and 
fourth lines of the square." This is a simple departure from the original 
Playfair scheme in which the letters of the keyword are written from left 
to right and in consecutive lines from the t«p downward. 
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But NKp does not = AN C , nor does 
KN p =' NA C 

Reciprocity is only partial. 

( 3 ) The foregoing gives rise to the follcwing. 


(a) Regardless of the position of the letters 
in the square, if 

1.2 - 3.4, then 

2.1 = 4.3 

(b) If 1 and 2 form opposite corners of a rec¬ 
tangle, the following equations obtain- 

1.2 = .3.4- 
2.1 = 4.3 

3.4 = 1.'2 

4.3 » 2.1 

( 4 ) A letter considered as occupying a position in 
a row can be combined with but four other letters in the 
same rowj the same letter considered as occupying a position 
in a column can be combined with but four other letters in 
the same column. Thus, this letter can be combined with 
only 8 other letters all told, under Case 2, above. But 
the same letter considered as occupying a corner of a 
rectangle can be combined with 16 other letters, under 
Case 1, above. Commander Smith derives from these facts 
the conclusion that "it would appear that Case 1 is twice 
as probable as Case 2." He continues thus; 

"Now in the square, note that: 


GNp = FK C m v Z FL C 

0N p = MK C also ET p = FP C 

CN p = TK C EW p = FV C 

XN p = WK C EF p = FG C 

"From this it is seen that of Jthe 24 equations that 
can be formed when each letter of the square is 'employed 
either as the initial or final letter of the group, five 
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will indicate a repetition of a corresponding letter 
of plain text. 

"Hence, RULE II. - After it has been determined, 
in t'ho equation 1.2 = 3.4, that, say, SN t , as FA C , there 
is a probability of one in five that any other group 
beginning with F c indicates E0 p , and that any group 
ending in A c indicates 0N p . 

"After such combinations as ERp 0R p and EN p have 
been assumed or determined, the above rule may be of use 
in disco /ering additional digraphs and partial words. 

RULE III. - In the equation 1.2 » 3.4, 1 and 3 
can never be identical, nor can 2 and 4 ever be identical. 

Thus, ANp could not possibly be represented by AY r , nor 
could ERp be represented by KR C . This rule is useful in 
,elimination of certain possibilities when a specific 
• .message is being studied. 

1 . There is an error in this reasoning. Take, for example, the 24 equations 
having F as an initial letter: 


Case 


Case 



Case 


Case 




.1 

F5 C = DN p 

2 

FE 

w ED 

2 

FT ■ M 

1 

FX 

B 

GW 

2 

Fp * EH 

1 

FL 

« EM 

2 

fw - nt 

1 

FR 


HN 

■ 1 

FI = EM 

1 

FP 

» ET 

1 

FK = GN 

2 

FH 

« 

EG 

■1 

tfU r nr 

1 

FV 

= EW 

2 

FG 5 EF 

1 

FQ 

s 

HM 

1 

F5 , = DW 

2 

FN 

= NW 

1 

FO = GM 

1 

FY 

s 

HT 

■1 

FA - SN 

2 

FM 

= NF' 

' 1 

FC = GT 

1 

FZ 

? 

HW 

Here, 

f the initial 

letter 

F c 

represents the following 

initial 0, 

o s 5 



•I 

1 



D E 

N G H 






It is seen that F c 

represents Bp, 

*P* V H 1 

3 4 times 

each, and 

E n’ 

, f 

3 times, 


Consequently, supposing that it has been determined that FA C - EN p , the 
probability that F^ will represent E p is not 1 in 5 feut 8 in 24, or 1 in 
3; but supposing that it has been determined that Ft? = NT p , the proba¬ 
bility that F c will represent N p is 4 in 24 or 1 in 6. The difference 
in these probabilities is occasioned by the fact that the first instance, 
FA C - ENp corresponds to a Case 1 encipherment, the second instance, 

FW c - NTp, to a Case 2 encipherment. But there is no way of knowing 
initially, and without other data, whether one is dealing with a Case 1 
or Case 2 encipherment. Only as an approximation, therefore, may one say 
that the probability of F c representing a given 6 p is 1 in 5. 
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RULE IV. - In the equation 1.2 p = 3.4 C , if 2 and 3 
are identical, the letters are all in the same row or 
column, and in the relative order 124. In the square 
shown, AKp = NK C and the absolute order is ANK. The 
relative order 124 includes five absolute orders .which 
are cyclic permutations of one another. Thus; ANK—, 

NK—A, K—AN, —ANK, and -ANK-. 


RULE V. - In the equation 1.2 = 3.4 C , if 1 

and 4 are identical, the letters are all in the same row 
or column, and in the relative order 243. In the square 
shown, KNp - RK C and the absolute order is NKR. The rela¬ 
tive order 243 includes five absolute orders which are 
cyclic permutations of one another. Thus NKR—, KR—N, 

R—NK, —NKR, and -NKR— 


RULE VI. - "Analyze the message for group re¬ 
currences. Select the groups of greatest recurrence and 
assume them to be high-frequency digraphs. Substitute 
the assumed digraphs throughout the message, testing the 
assumptions in their relation to other groups of the cipher. 

The reconstruction of 'the square proceeds simultaneously 
with the solution of the message and aids in hastening 
the translation of the cipher." 

(l) When solutions for the Playfair cipher system were 
first developed, based upon tho fact that the letters were inserted in. 
the calls in keyword-mixed order, cryptographers thought it desirable 
to place stumbling blocks in the path of such solution by departing from 
strict, keyword-mixed order. Playfair squares of tho latter typo are 
designed as "modified Playfair squares". One of the simplest methods is 
illustrated in Fig. 25, wherein it will bo noted that the last five 
letters of the keyword proper are inserted in tho fourth row of the square 
instead of the second, where they would naturally fall. Another method 
is to insert tho letters within the cells from loft to right and top down¬ 
ward but use a sequence that is a keyword-mixed sequence developed by a 
columnar transposition based upon the keyword proper. Thus, using the 
keyword BANKRUPTCY: 

215479683 10 
. .BANKRUPTCY 
DEFGHILMOQ 
5 V W X Z 


Sequence: ASVBDSCOKGXNF’VPLRHZTNEUIYQ 
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Playfair Square 


A 

E 

V 

B 

D 

S 

C 

0 

K 

G 

X 

N 

F 

. W 

P 

IT 

R 

H 

Z 

T 

M 

U 

I 

Y 

til 


Pig. 25 b 

(2) In the foregoing square practically all indi¬ 
cations that the square has been developed from a key¬ 
word have disappeared. The principal disadvantage of 
such an arrangement is that it requires more time to 
'locate the letters desired,-both in cryptographing and 
decryptographing, than it usually does when a semblance 
'of normal alphabetic order is preserved in the square. 


(3) Note the following three squares: 



t jyt first glance they all appear to be different, but 

closer examination shows them to be cyclic permutations 
of one another and of the square in Fig. 25 b. They 
-yield identical equivalents in all cases. However, if 
an attempt be made to reconstruct the original keyword, 
it would be much easier to d* so from Fig. 25 b than 
from any of the others, because in Fig. 25 b the keyword- 
mixed sequence has not been disturbed as much as in 
Figs. 25 c, d, e. In working with Playfair ciphers, the 
student should be on the lookout for such instances of 
Cyclic permutation of the original Playfair square, for 
during the course of solution he will not know whether 
he is building up the original or an equivalent, cyclic 
permutation of the original square; only after he has 
completely reconstructed the square will he be able to 
determine this point. 

e. (l) The steps in the solution of a typical example 

of this cipher may be useful. Let the message be as follows: 
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12345 6 7 8 9 30 3112131415 36 1718 19 20 a 22 23 2125 25 27 28 29 3D 

A. VTQEU HIOFT OH X S 6 AKTVT R A Z E V TAGAE 

B. OXTYM HGRLZ Z T Q T D. UM'GYC XCTGM TYCZU 

C. SNOPD GXV X S C A K T V. PU T 2 P T W ZFNBG 

D. PTRKX IXBPR ZOEPU TOLZE KTTCS NHCQM 

E. VTRKM If C F 20 BHTVY A B G I P RZKPC QFNLV 

F. OXOTU Z FACX XGPZX H GYNO TYOLG XXIIH 

G. T M S M X CPTOT GX OTT-GYATE XH FAC XXCPZ 

H. XJL y CT XWLZT 8GPZT VYY/GE TOG C C MBHMQ 

J. Y X Z P W G R T I V." . U X P U M jJR KM Y\T G X T M R S W G H B 

K. X G P T 0 t'g X' 0* T' iM I P Y D % b G K ( 1* ‘T'C'O.L X U E T P X 

. : ‘ ' '*■ '“} _ ' , r ' ” 

l. xfsrs uztdb; . hozig 4 jc'rki’x ‘zppvz iduhq 

M. OTKTK CCH X, X 

(2) Without going through the preliminary tests in de¬ 
tail, with which it will be assumed that the student is now familiar,1 
the conclusion is reached that the cryptogram is digraphic in nature, 
and an ordinary, simple digraphic frequency distribution is made. 

1 See Par. 48 c. *' S4 »* 1 
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Since there are no double-letter groups, the conclusion is reached that 
a Playfair cipher is involved and the message is rewritten in digraphs. 



1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

A. 

VT 

QE 

UK 

10 

FT 

CH 

XS 

CA 

KT 

VT 

RA 

ZE 

VT 

AG 

AE 

B. 

OX 

TY 

UH 

CR 

LZ 

ZT 

QT 

DU 

MC 

YC 

XC 

TG 

MT 

YC 

ZU 

C. 

SN 

OP 

DG 

XV 

XS 

CA 

KT 

VT 

PK 

PU 

TZ 

FT 

WZ 

FN 

BG 

D. 

FT 

RK 

XI 

XB 

PR 

ZO 

EP 

UT 

OL 

ZE 

KT 

TC 

SN 

HC 

QM 

E. 

VT 

RK 

MW 

CF 

ZU 

BH 

TV 

YA 

BG 

IP 

RZ 

KP 

CQ 

FN 

LV 

F. 

OX 

OT 

uz 

FA 

CX 

XC 

-22- 

XH 

CY 

NO 

TY 

OL 

GX 

XI 

IH 

G. 

TM 

SM 

XC 

PT 

OT 

CX 

OT 

TC 

YA 

TE 

XH 

FA 

CX 

XC 

PZ 

H. 

Si 

YC 

TX 

WL 

ZT 

SG 

PZ 

TV 

_ YW 

CE 

TW 

GC 

CM 

BH 

MQ 

J. 

YX 

ZP 

m 

RT 

IV 

UX 

FU 

MQ 

RK 

MW 

CX 

TM 

RS 

WG 

HB 

K. 

xc. 

FT 

OT 

CX 

OT 

MI 

PY 

DN 

FG 

KI 

TC 

OL 

XU 

ET 

PX 

L. 

XF 

SR 

SII 

ZT 

DB 

HO 

ZI 

GX 

RK 

IX 

ZP 

FV 

ZI 

DU 

HQ 

M. 

OT 

KT 

KC 

OH 

XX 












(3) The following three fairly lengthy repetitions 

are noted: 

Lines 


F: 

OT 

UZ 

FA 

CX 

XC 

PZ 

XH 

CY NO 

G: 

TE 

XH 

FA 

—> 
CX 

XC 

PZ 

XH 

YC TX 









-or 

A: 

FT 

CH 

XS 

CA 

KT 

VT 

RA 

ZE 

C: 

DG 

XV 

XS 

CA 

KT 

VT 

PK 

PU 

G: 

TM 

SM 

XC 

FT 

OT 

CX 

OT 

TC 

K: 

WG 

HB 

XC 

FT 

CT 

cx 

OT 

MI 


The first long repetition, with the sequent reversed digraphs CX and XC 
immediately suggests the word BATTALION, split up into -B AT TA LI ON 
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and the sequence containing this repetition in lines F and G becomes 
as followst 


Line 

F: 

OX 

OT 

UZ 

FA 

CX 

XC 

PZ 

XH 

CY 

NO 

TE 






_ B 

AT 

TA 

LI 

ON 




Line 

G* 

YA 

TE 

XH 

fT 

CX 

XC 

PZ 

"XH 

YC 

TX 

WL 





ON 

B 

AT 

TA 

LI 

ON 





(4) Because of the frequent use of numerals beforo the 
word BATTALION and because of the appearance of ON before this word in 
line G, the possibility suggests itself that the word before BATTALION 
in line G is either ONE or SECO'ND. The identical digraph FA p.n both 
cases gives a hint that the word BATTALION in line F may also be pre¬ 
ceded by a numeral; if ONE is correct in line G, then THREE is possible 
in line F. On the other hand," "if SECOND is correct in line G, then 
THlRD’is possible in line F. Thus: 


Line Fj 

OX 

OT 

UZ 

FA 

"CX 

XC 

PZ 

XH 

CY 

NO 

TE 

1 st hypothesis 

— 

TH 

RE 

EB 

AT 

TA 

LI 

ON 




2 nd hypothesis 

— 

TH 

IR 

DB 








Line G* 

YA 

TE 

XH 

W 

CX 

XC 

PZ 

"XH 

YC 

TX 

WL. 

1 st hypothesis 


— 

ON 

EB 

AT 

TA 

LI 

ON 




2 nd hypothesis 

-S 

EC 

ON 

DB 









First, note that if either hypothesis is true,- then 0T C = TH p . The 
frequency distribution shows that OT occurs 6 times and is in fact the 
most frequent digraph in the message. Moreover, by Rule I of subpara¬ 
graph b, if CT c - THp then TO c - HT . Since HT p is a very rare digraph 
in normal plain text, T0 C should either not occur at "all in so short a 
message or else it should be very infrequent. The frequency distribu¬ 
tion shows its entire absence. Hence, there is nothing inconsistent 
with the possibility that the word in front of BATTALION in line F is 
THREE or THIRD, and some evidence that it is actually one or the other. 

(5) But can evidence be found for the support of one 
hypothesis against the other? Let the frequency distribution be examined 
with a view to throwing light upon this point. If the first hypothesis 
is true, then UZ = RIL, and, by Rule I, 2tf c = ER p . The frequency dis¬ 
tribution shows but one occurrence of UZ C and but two occurrences of ZU C . 
Theso do not look very good for-RE and ER. On the othor hand, if the 
2nd hypothesis is true, then UZ C = IR p and, by Rule I, ZU Q = RI . The 
frequencies are much more favorable in this case. Is there anything 
inconsistent with tho assumption, on the basis of the 2nd hypothesis, 
that TE C = EC p ? The frequency distribution shows no inconsistency, f*»r 
TE C occurs-once and ET C (= CE p , by Rule l) occurs once. As regards 
whether FA C r EB p or DB , both hypotheses are tenablo; possibly the 
2 nd hypothesis is a shade better than the 1st, on the following reasoning. 



REF ID:A64644 


- 112 - 


By Rule I, if FA c _ EBp then AF C - BEp, nr if FA C - DB p then AF C . BDp. 
The fact that no AF C occurs, whereas at least one BEp may be expected 
in this message, inclines one to the 2nd hypothesis, since BDp is very 
rare. .... 


(6) Let the 2nd hypothesis be assumed to be correct. 

The additional values are‘tentatively inserted in the text, and in lines 
G and K two interesting repetitions are noted* 


Line 

G* 

TM 

SM 

XC 

PT 

OT 

CX 

OT 

TC 

YA 

TE 

XH 

FA 

CX 

XC 

PZ 

XH 





TA 


TH 

AT 

TH 


-S 

EC 

ON 

DB 

AT 

TA 

LI 

ON 

Line 

K*' 

WG 

HB 

X~ 

FT 

OT 

CX 


MI 

PY 

DN 

FG 

KI 

TC 

OL 

XU 

ET 


TA TH AT TH 


This certainly looks like STATE THAT THE ..., which would make TE p = PT C . 
Furthermore, in line G the sequence STATETHATTHE..SECONDBATTALION’ can 
hardly be anything else than STATE THAT THEIR SECOND BATTALION, which 
would make TC c = EIp a“rid YX” W RSp. 'Also SM C = -Sp. 

(7) It is perhaps high time that the whole list of tenta¬ 
tive equivalent values be studied in relation to their consistency with 
the positions of-letters'in the Playfair square; moreover, by so doing, 
additional values may be obtained in the process. The complete list of 
values is as fellows; 


Assumed values 


Derived by Rule I 


AT p = CX c 

LI p = PZ c 
0N p - XH C 
TH p = 0T C ' 
IR p - uz c 
DB p - FA C 
ECp = TE C 
TE p = PT C 
EI p = TC c 
RS p = YA C 
-S p - SM C 


TAp - X0 C 

^ = ZP c 
N0 p = HX C 
HT p = T0 C 
Rip - ZU C 
BDp * AF C 
CE p = ET C 
ET p = TP C 
IE p = CT c 
SRp = AY C 

S-p = MS C 
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(8) By Rule V, the equation TH p - 0T C means that H, T, 
and 0 are all in the same row or column and in the relative order 243; 
similarly, C, 3, and T are in the same row or column and in the relative 
order 243.„ Further E, P, and T are in the same row, and column; and their 


relative* order is 

also 

243. That is, 

those 

sequences must occur in the 

square: 

(1) 


(2) 


(3) 


H T 

0 . . , 

or 

CET.. 

» or 

E T P . . , 

or 

T 0 

. . H , 

or 

E T . . C 

, or 

TP. . E , 

or 

0 . 

. H T , 

or 

T . . C E 

* or 

P . . E T , 

or 

’ ' ’ *■ h 

HTO, 

• r 

..CET 

, or 

' . •. E T P , 

or 

"T~V h 

T 0 


. ClfT . 


,;*e T P . 



(9) 

Noting the common 

letters E and T in the 

second and 

third sets 

of relative 

orders, these may be 

combined into one sequence 


of four letters. Only one position remains to be fillod and noting, in 
tho list of equivalents that EI p = TC c , it is obvious that the letter 
I belongs to tho CET sequence. The complete soquonce is therefore as 
follows* 

• ■ ■ C E T p I , or 

E T P I C , or 
IPIcr, or 
: P I G E T , or 

I C E T P 

(10) Taking up the HTO sequence, it is notod, in the 
list of equivalents that ONp - XH c , an equation containing two of tho 
throo lqtters of tho HTO sequence. From this it follows that N and X 
must belong to the same row or column as HTO. Tho ^arrangoment must be 
one of thq following: 

- . ' i , f . H T 0 X N 

T 0 X N H 
0 X N H T 
X N H T 0 
N H T 0 X 

' ■ ■ (11) Since tho sequence containing ,HTOXN has a common 

lottor (T) with tho sequonco CETPI, it follows that if the HTOXN soquonce 
occupies a row, then tho CETPI sequence must occupy a column; or, if 
the HTO sequence occupies a column, then the CETPI sequence must occupy 
a row; and they may be combined by means of their common lottor, T. 

Simple calculation will show that the two sequences may be combined in 
50 different ways, all of them yielding identical sets of equivalents. 
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(ere are a few of them: 


CD (2) (3) (4) (5) 



(11) (12) (13) (14) (15) 



(16) (17) (18) (19) (20) 



(12) Before trying to discover means whereby the actual 
rr absolute arrangement may be detected from among the full set of 50 
possible arrangements, the question may be raised: is it necessary? 
Since any one of the 50 arrangements will yield the same equivalents a: 
any of the remaining 49, perhaps a relative arrangement will do. 


(13) Let arrangement 13 be arbitrarily selected for trial 










REF ID:A64644 


- 115 - 


P 

_2_ 

~T ~ c_ 

_ E_ 

n|h|t 


Foix 


(14) What additional lettors can be inserted, using as 
a guide the list of equivalents in subparagraph (7)? There is AT p = CX c , 
for example. It contains only one letter, A, not in the arrangomont 
selected for trial, and this letter may immediately be placed, as shown: 




P 

□ 




I 





C 

_ 

_A 



JL 



N 

H 

T 

o’ 

z 


Scanning tho list for additional cases of this type, nono are found. But 
seeing that several high-frequency letters havo already boon inserted 
in tho square, perhaps reference to tho cryptogram itsolf in connection 
with values derivod from theso inserted letters may yield further clues. 
For example, the vowels A, E, I, and 0 are all in position, as aro tho 
very frequent consonants N and T. The following combinations may bo 
studiod: 


ANp - ?X C 

* 

it 

* 

NA p = X? 

TAp - XC c 

EN p = ?T C 

ET p = TP C 

NEp = T? 

TE p = PT C 

lN p = ?T C 

IT p * GP 0 

Nip = T? 

Tip = PC c 

0N p r XH C 

0T p = X0 C 

N0 p = HX C 

*-3 

II 

CX c ), TAp 

(» XC c ), 0N p (= XH C ) 

, TE p (» PT C ) 

and ETp (= TP C ) have 


already been inserted in tho text. Of tho others, only 0X C (= TOp) 
occurs two times, and this value can bo at once inserted in the text. 
But can the equivalents of AN, EM, or IN bo found from frequency con¬ 
siderations? Take EN p , for example; it is represented by ?T . What 
combination of ?T is most likely to represent EN^ among tho following 
candidates: 

KT C (4 times); by Rule I, NE p would s TK C (no occurrences) 

VT C (5 times); " " " NE p " = TV C (2 times) 

ZT C (3 times); " " H NE p " « TZ C (l time) 
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VT C cortainly looks good* it bogins tho message, suggesting the word 
ENEMY; in line H, in the sequence PZTV would become LINE. Lot this be 
assumed to be correct, ani let the word ENEMY also be assumed to be 
correct. Then EMp = QE C and the square then becomes as shown herewith* 


_IP_ 

_jf_ 

__C_A 

ImjLSL- 

N H T 0 X 


(15) In line E is seon the following sequences 

Lino Es VT RK MW OF ZU BH TV Ya BG IP RZ KP CQ FN LV 

EN RI NE RS PT E 

Tho sequence RI..NERS..PT suggests PRISONERS CAPTURED, as follows* 

MW CF ZU PH TV YA BG IP RZ KP 

P RI SO NERSC-n.PT UR ED 

This givos tho following new values* -P = CF ; SO = BH C ; CA = BG C ; 
UR p .= RZ c 5 ED p “ KP C . 

Tho letters B and G can bo placed in position at once, since tho posi¬ 
tions of C and a are already known. The insertion of tho letter B im¬ 
mediately permits the placement of tho letter S, from the equation 
SOp sr BH C . Of the remaining equations only ED p = KP„ can bo used. Since 
E and P are fixed, and are in the same column, D and K must bo in the 
same column, and moreover the K must be in the same row as E. There 
is only one possiblo position for K, viz., immediately after Q. This 
automatically fixes the position of D. The square is now as shown here¬ 
with* 




P 


D 



T 



G 

5 

c 

B 

A 

V 

M 

E 

s. 

K 

N 

H 

T 

0 

x’ 


(16) a review of all equations, including tho very first 
ones established, givos the following which may now be used* DBp = FA 0 ; 
RSp _ YA C . Tho first permits tho immediate placement of F; the second, 
by elimination of possible positions, permits the placement of both R 
and Y. The square is now as sho-wn herewith* 
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P 

IF 

D 


Y 

I 


R 

SL 

v_ 

S 

M_ 

c_ 

E 

B 

SL 

A 

K 

N 

H 

T 

0 

X 


Oncq more a review is made of all remaining thus far unused equations. 
Lip = PZ Q now permits the placement of L and Z. IR s UZ 0 now pormits 
the placement of U, which is confirmed by the oquation UR_ = RZ from 
the word CAPTURED. 


l J ,p f|d 

Z Y_IU R 

G'S ClT 

V M E QTir 

"nIh ItIoIx 


There is thon only one cell vacant, and it must bo occupied by the only 
letter left unplaced, viz., W. Thus the whole square has been recon¬ 
structed, and tho message can now be decryptographed. 

(17) Is tho square just reconstructed identical with 
the original, or is it a cyclic permutation of a keyword-mixed Playfair 
square of tho typo illustrated in Fig. 25b? 'Even though tho message 
can be read with ease, this point is still of interest. Let the sequence 
bo writtqn in five ways, each composed of five partial sequences made 
by cyclicly permuting each of the horizontal rows of the reconstructed 
square. Thus: 




Row 

1 


Row 

2 



Rew 

3 



Rew 

4 



Row 

5 


(a) 

L 

W P 

F D 

Z 

Y 

I 

U 

R 

G 

S C 

B 

A 

V 

M E 

Q 

K 

N 

H 

T 

0 

X 

(b) 

W 

P F 

D L 

Y 

I 

U 

R 

Z 

S 

C B 

A 

G 

M 

E Q 

K 

V 

H 

T 

0 

X 

N 

(c) 

P 

F D 

L W 

I 

U 

R 

2 

Y 

C 

B A 

G 

S 

E 

Q K 

V 

M 

T 

0 

X 

N 

H 

(d) 

F 

D L 

W P 

U 

R 

Z 

Y 

I 

B 

A G 

S 

C 

Q 

K V 

M 

E 

0 

X 

N 

H 

T 

Co) 

D 

L W 

P F 

R 

Z 

Y 

I 

U 


G S 

C 

B 

K 

V M 

E 

Q 

X 

N 

H 

T 

0 


By experimenting with those five sequences, in an endeavor to reconstruct 
a transposition rectangle conformable to a keyword sequence, tho last 
sequence yields tho following: 

P Y A C M N 
D F I G B E H 
LRUSKQT 
7/ Z V X 0 
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By shifting tho 0 from tho last position to the first, and rearranging 
the columns, the following is obtained* 

2 5 3 6 1 4 7 
COMPANY 
B D JB F G H I 
KLQRSTH 
V W X Z 

Tho original square must hp.ve been this: 


A 

G 

s 

c 

B 

k1 

17 

M 

E 

Q_ 

r 

N 

H 

T 

0 

D 

L 

¥ 

"P 

F 

R 

Z 

y 

I 

(J 


f. Continued practice in the solution of Playfair ciphers will 
make tho student quite export in the matter and will enable him to solve 
shorter and shorter messages. Also, with practice it will become a mat¬ 
ter of indifference to him as to whether the letters are insorted in the 
squ.aro with any sort of regularity, such as simple keyword -mixed ordor, 
columnar transposed keyword-mixed order, or in a purely random order. 

g. It may porhaps soom to tho student that tho foregoing steps 
are somewhat tco artificial, a bit too "cut and dried" in thoir accuracy 
to portray the process of analysis, as it is applied in actuality. For 
example, the critical student may woll object to some of tho assumptions 
and the reasoning in stop (5) above, in which the words THREE and ONE 
(1st hypothesis) were rojectod in favor of tho words THIRD and SECOND 
(2nd hypothesis). This rested largely upon the rejection of RE and ER p 
as the equivalents of U^ c and ZU c , ajid the adoption of IR p and RI p as 
their equivalents. Indeed, if the student will oxamino tho final mossage 
with a critical eye ho will find that while the bit of reasoning in step 
(5) is perfectly logical, tho assumption upon which it is based is in 
fact wrong, for it happens that in this case ER^ occurs only once and 
RE p does not occur at all. Consequently, although most of the reasoning 
which led to the rejection of the 1st‘hypothesis and the adoption of tho 
2nd was logical, it was in fact based upon erroneous assumption. In 
other words, despite the fact that tho assumption was incorrect, a correct 
deduction was made. The student should take note that in cryptanalysis 
situations of this sort ar.o not at all unusual . Indeed they are to be 
expected and a few words of explanation at this point may be useful. 

h. Cryptanalysis is a science in which a very large role is played 
by making deductions from observational data and the deductions usually 
rest upon assumptions. It is most often the case that tho cryptanalyst 
is forced to make his assumptions upon a quite limited amount of text. 
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It cannot be oxpectod that assumptions basod upon statistical generali¬ 
zations will always hold truo when appliod to data comparatively very 
much smaller in quantity than tho total data used to derive the genera¬ 
lized rules. Consequently, as regards assumptions mado.in specific 
mossdgos, most of tho time thoy will bo correct, but occasionally thoy 
will bo incorrect. In cryptanalysis it is often found that among the 
correct deductions there will bo casos in which subsequently discovered 
facts do not boar out the assumptions on which tho deduction was basod. 
Indood, it is sometimes truo that if tho facts had boon known before tho 
deduction was made, this knowledge would havo provontod making tho cor¬ 
rect deduction. For example, suppose tho cryptanalyst had somehow or 
other devinod that tho message under consideration contained no RE, only 
one 3R, one IR, and two RI's (as is actually tho case). Ho would certainly 
not havo boon able to choose betwoon the words THREE and ONE (1st hypothe¬ 
sis) as against THIRD and SECOND (2nd hypothosis). But bocause he assumes 
that there should bo more-ERp's and-REp's than IR's-and RI's in tho message, 
he deduces that UZ C cannot be RE-, rejects the 1st hypothesis and takes 
the 2nd. 'It later turns out, after tho problem has been solved, that the 
deduction'was'correct, although tho assumption on which it was based 
(expectation of more frequent appearance of RE p and ERp) was not in fact 
true in this particular case. The cryptanalyst can only hope that tho 
number of "times when his deductions aro correct, even though based upon 
assumptions which later turn out to bo erronoous, will abundantly exceed 
thp numbor of timos whon his deductions are wrong, oven though based upon 
assumptions which later provo t» be correct. If ho is lucky, the making 
of an assumption which is really not truo will make no differonco in tho 
end and will not delay solution; but if ho is specially favored with luck, 
it may actually holp him solvo the message—as vas tho case in this parti¬ 
cular oxample. 

i. another comment »f a general nature may bo mado in connection 
with this specific examplo. Tho student may ask what would have been tho 
procedure in this case if the message had not' contained such a tell-tale 
repetition as the word BATTALION, which formod the point of departure for 
the solution, or, as it is often said, permitted an "entoring wedge" to 
bo drivori into tho message. Tho answor to his query is that if tho word 
BATTALION had not boon repeatod, there would probably havo been some other 
repetition which would have permitted tho same sort of attack. If tho 
stud"6n£ is looking for cut and dried, straightforward, unvarying methods 
of attack, ho should remember that cryptanalysis, while it may be considered 
a branch of mathematics, is not a science which has many "general solu¬ 
tions" such as aro found and expected in mathematics proper. It is in¬ 
herent in the very nature of cryptanalytics that, as a rule , only goneral 
principlos can bo established; their practical application must tako ad¬ 
vantage of peculiarities and particular situations which are noted in 
•'’specific mossagos. This is especially truo in a text on tho subject. 

The illustration of a gonoral principle requires a specific oxample, and 
the latter must of nocossity manifest characteristics which make it differ¬ 
ent from any other oxample. Tho’ word BATTALION was riot purposely repeated 
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in this example in order to make the demonstration of solution easy: 

"it just happened that way". In another example, some other entering 
wedge would have been found. The student can be expected to learn only 
the general principles which will enable him to take advantage of the 
specific Characteristics manifested in specific cases . Here it is de¬ 
sired to illustrate the general principles of solving Playfair ciphers 
and to point out the fact that entering wedges must and can be found. 

The specific nature of the entering wedge varies with specific examples. 


SECTION X 

CONCLUDING REMARKS’ 


Paragraph 


Special remarks concerning the initial classification of 

cryptograms.•... 47 

Ciphers employing characters other than letters or figures ... 48 

Concluding remarks concerning monoalphabetic substitution. ... 49 

Analytical key for cryptanalysis . 50 


47. Special remarks concerning the initial classification of cryp¬ 
tograms. - a. The student should by this time have a good conception of 
the basic nature of monoalphabetic substitution and of the many "changes" 
which may be "rung" upon this simple "tune". The first step of all, 

naturally, is to be able to classify a cryptogram properly and place it 

in either the transposition or the substitution class. The tests for 
this classification have been given and as a rule he will encounter no 
difficulty in this respect. 

b. There are, however, certain kinds of cryptograms whose class 

cannot be determined in the usual manner, as outlined in Par. 13 of this 
text. First of all there is the type of code message which employs bona 
fide dictionary words as code groups.1 Naturally, a frequency distribu¬ 
tion of such a-message will approximate that for normal plain text. The 
appearance of the message, however, gives clear indications of what is 
involved. The study of such cases will be taken up in its proper place. 
At the moment it is only necessary to point out that these are code 

messages and not cipher , and it is for this reason that in Pars. 12 and 

13 the words "cipher" and "cipher messages" are used, the word "crypto¬ 
gram" being used only where technically correct. 

c. Secondly, there come the unusual and borderline cases, including 
cryptograms whose nature and type can not -be ascertained from frequency 
distributions. Here, the cryptograms are technically not ciphers but 
special forms of disguised secret writings which are rarely susceptible 
of b.eing classed as transposition or substitution. These include a large 
share of the cases wherein the cryptographic messages are disguised and 
carried under an external, innocuous text which is innocent and seemingly 


1 


See Par. 71, Special Text No. 165, Elementary Military Cryptography. 
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without cryptographic content - for instance, in a message wherein specific 
letters are indicated in a way not open to suspicion under censorship, 
these letters being intended to constitute the letters of the cryptographic 
message and the other letters constituting "dummies"* Obviously, no amount 
of frequency tabulations will avail a competent, expert - cryptanalyst in 
demonstrating or disclosing the presence of a cryptographic messago, 
writton and secreted within the "open" message, which serves but as an 
envelop and disguise for its authentic or real import. Certainly, su»h 
frequency tabulations can disclose the existence neither of substitution 
nor transposition in these c-asos, sinco both forms aro absent. Another 
very popular method that resembles the method mentioned above has for its 
basis a simplo grille. The whole words forming the secret text aro in¬ 
serted within perforations cut in the paper and the remaining space filled 
carefully, using "nulls" and "dummies", making a seemingly innocuous , 
ordinary message. There are other methods of this general type which can 
obviously neither be detected nor cryptanalyzed, using the principles of 
frequency of recurrences and repetition. These can not be further dis¬ 
cussed herein, but at a subsequent date a special text may be written for 
their handling. 

d. In view of the foregoing remarks, when so-called "symbol ciphers", 
that is, ciphers employing peculiar symbols, signs of punctuation, diacriti¬ 
cal marks, figures of "dancing men", and so on are encountered in practi¬ 
cal work nowadays, they are almost certain to be simple, monoalphabetic 
ciphers. They are adequately described in romantic tales, in popular 
books on cryptography, and in the more common types of magazine articles. 

No further space need be given ciphers of this type in this text, not only 
because of their simplicity but also because they are encountered in mili¬ 
tary cryptography only in sporadic instances in censorship activities. 

Sven in the latter cases, it is usually found that such ciphers are em¬ 
ployed in "intimate" correspondence for the exchange of sentiments that 
appear less decorous when set forth in plain language. They are very 
seldom or never used by authentic enemy agents. When such a cipher is 
encountered nowadays it may practically always be regarded as the work of 
the veriest tyro, when it is not that of a "crank" or a mentally deranged 
person. 


e. The usual preliminary procedure in handling such cases, where 
the symbols may be somewhat confusing to the mind because of their un¬ 
familiar appearance to the eye, is to substitute letters for them con¬ 
sistently throughout the message and then treat the resulting text as an 
ordinary cryptogram composed of letters is treated. This procedure also 
facilitates the construction of the necessary frequency distributions, 
yrhich would t be tedious to construct by using symbols. 

The subparagraph which the student has just read (47c) contains a hidden 
cryptographic messago. With the hints given in Par. 35e let the student 
Bee if he can find it. 

^ The most famouss Poe's "Gold Bug"; -Arthur Conan Doyle's "The Sign of 
Four". 
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f. A final word must be said on the subject of symbol ciphers by 
way of caution. When symbols are used to replace lottors, syllables and 
entire words, then the systoms approach code mothods in principle, and 
can become difficult of solution.^- The logical extension of the use of 
symbols in such a form of -writing is tho employment of arbitrary characters 
for a specially developed "shorthand" system bearing little or no rosom- 
blanco to woll-known, and thoroforo nonsocrot, systems of shorthand, such 
as "Grogg", "Pitman", otc. Unloss a considerable amount of toxt is avail¬ 
able for analysis, a privatoly-dovisod shorthand may bo very difficult if 
not impossible to solvo. Fortunately, such systoms are novor encountered 
in military cryptography. Thoy fall undor tho heading of cryptographic 
curiosities, of intorest to tho cryptanalyst in his loisuro moments.2 

48. Ciphers employing characters othor than lottors or figures. - a. 

In practical cryptography today, tho uso of characters othor than tho 26 
lottors of tho English alphabot is comparatively raro. It is true that 
there are a fow governments which still adhoro' to systoms yielding cryp¬ 
tograms in groups of figures. Theso aro almost in ovory case code systems 
and will bo treat id in their proper placo. In some cases cipher systoms, 
or systems of onciphoring code aro used which are basically mathematical 
in character and operation, and therefore uso nuinbors instoad of letters. 

Some persons aro inclined toward tho use of numbers rather than letters 
bocause numbers lend themselves much moro readily to certain arithmetical 
operations such as addition, subtraction, and so on, than do lottors. But 
there is usually added some final process whereby tho figure groups are 
converted into letter groups, for tho sake of oconomy in transmission. 

b. Tho only notable exceptions to tho statement contained in the 
first sentence of this paragraph aro those of Russian messages transmitted 
in the Russian Morse alphabet and Japanoso messages, transmitted in tho 
Kata Kana Morse alphabet. 

49. Concluding remarks concerning monoalphabotic substitution. - a. 

Tho alert student will have by this timo gathored that tho solution of 
monoalphabotic substitution ciphers of tho simplo or fixod typo aro particu¬ 
larly easy to solvo, onco the underlying principles aro thoroughly under¬ 
stood. As in othor arts, continued practice with examples loads to facility 
and skill in solution, ospocially where the student concentrates his atten¬ 
tion upon traffic all of tho same general naturo, so that the typo of text 

1 Tho uso of symbols for abbroviation and speed in writing goes back to 
the days of antiquity. Cicero is reportod to havo drawn up "a book like 
a dictionary, in which ho placed bofore each word the notation (symbol) 
which should represent it, and so groat was tho number of notations and 
words that whatovor could bo written in Latin could bo expressed in his 
notations." 

2 An example is found in tho famous Popys Diary, which was written in 
shorthand, purely for his own eyes by Samuel Popys (1633-1703). "Ho wrote 
it in Shelton's systom of tachygraphy (1641), which ho complicated by using 
foroign languages or by variotios of his own invention whonovor he had to 
record passagos least fit to bo seen by his sorvants, or by 'all tho world'." 
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which ho is continually oncountoring becomes familiar to him and its 
peculiarities or characteristics of construction givo duos for short 
cuts to solution. It is truo that a knowlodgo of tho gonoral phraseology 
of moSSagb^,' tho kind of words used, their sequences, and so on is of very 
great assistnneo in practical work in all fiolds of cryptanalysis. The 
student is urged to note particularly these finer dotails in tho course 
of his study. 

b. Another thing which the student should be on the lookout for in 
simple monoalphabetic substitution is the use, consecutively of several 
different mixed cipher alphabets in a single long message. Obviously, a 
single, composite frequency distribution for the whole message will not 
show the characteristic crest and trough appearance of a simple monoalpha¬ 
betic cipher, since a given cipher letter will represent different plain¬ 
text letters in different parts of the message. But if the cryptanalyst 
will carefully observe the distribution as it is being compiled, he will 
note that at first it presents the characteristic crest and trough appear¬ 
ance of monoalphabeticity, and that after a time it begins to lose this 
appearance. If possible he should be on the lookout for some peculiarity 
of grouping of letters which serves as an indicator for the shift from 
one cipher alphabet to the next. If he finds such an indicator he should 
begin a second distribution from that point on, and proceed until another 
shift or indicator is encountered. By thus isolating the different portions 
of the text, and rostricting the frequency distributions to the separate 
monoalphabets, the problem may be treated then as an ordinary simple mono- 
alphabetic substitution. 

c. Monoalphabetic substitution with variants represents an extension 
of the basic principle, with the intontion of masking the characteristic 
frequencies resulting from a strict monoalphabeticity, by moans of which 
solutions are rather readily obtained. Some of tho subterfugos applied 

in tho establishment of variant or multiplo values arc simple and more or 
less fail to servo tho purpose for which thoy are intended? othors, on 
the contrary, may interpose sorious difficulties to a straightforward solu¬ 
tion. But in no case may the problem be considered of more than ordinary 
difficulty”." Furthermore, it should bo recognized that whore these subter¬ 
fuges aro really adequate to tho purpose, tho complications introduced are 
such that tho practical manipulation of tho systom becomes as difficult 
for the cryptographer as for tho cryptanalyst. 

d. As already mentioned in monoalphabetic substitution with variants 
it is most common to employ figures or groups of figures. The reason for 
this is that tho use of numerical groups seems more natural or easier to 
tho uninitiated than does the use of varying combinations of lottors. 
Moreover, it is oasy to draw up cipher alphabets in which some of the let¬ 
ters are ropresontod by single digits, others by pairs of digits. Thus, 
the decomposition of tho cipher text which is an irregular intermixture of 
mono! it era.!'and polylitoral equivalents, is made more complicated and 
correspondingly difficult for tho cryptanalyst, who does not know which 
digits aro to bo used soparatoly, which in pairs. 
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e. A few words may bo added horo in regard to a method which often 

suggests itself to laymen. This consists in using a book possessed by 
all the correspondents and indicating the letters of the message by moans 
of numbers referring to specific lottors in tho book. One way consists 
in selecting a certain page and thon giving tho lino number and position 
of the letter in the lino, tho pago number being shown by a single initial 
indicator. Anothor way is to use tho entire book, giving tho cipher 
equivalents in groups of three numbors representing page, lino, and number 
of letter. (Ex.: 75-8-10 moans pago 75, 8th lino, 10th letter in the 

line.) Such systems aro, however, oxtremely cumbersome to use and, when 
the cryptographing is done carelessly, can bo solved. Tho basis for solu¬ 
tion in such cases rests upon the use of adjacent letters on the same line, 
the accidental repetitions of cortain lottors, and the occurrence of un- 
enciphored words in tho messages, when lazinoss or fatiguo intervenes in 
tho cryptographing. 

f. It may also be indicated that human nature and tho fallibility 
of ciphor clerks is such that it is rather rare for an encipherer to make 
full use of the complement of variants placed at his disposal. The result 
is that in most cases certain of tho equivalents will bo usod so much 
more ofton than others that diversities in frequencies will soon manifest 
thomsolves, affording important data'for attack by tho cryptanalyst. 

g. In the World War the cases whore monoalphabetic substitution 
ciphers were employed in actual operations on the Western Front were ex¬ 
ceedingly rare because the majority of the belligerents had a fair know¬ 
ledge of cryptography. On the Eastern Front, however, the extensive use, 
by the poorly prepared Russian Army, of monoalphabetic ciphers in the 
fall of 1914 was an important, if not the most important, factor in the 
success of the German operations during the Battle of Tannenberg. It 
seems that a somewhat more secure cipher system was authorized, but proved 
too difficult for the untrained Russian cryptographic and radio personnel. 
Consequently, recourse was had to simple substitution ciphers, somewhat 
interspersed with plain text, and sometimes to messages completely in 
plain language. The damage which this faulty use of cryptography did to 
the Russian Army and thus to the Allied Powers is incalculable. 

h. Many of the messages found by censors in letters sent by mail 
during the World War were cases of monoalphabetic substitution, disguised 
in various ways. 

Gyldon, Yvos. Chifforbvr&ernas Insatsor I VS.rldskriget Till Lands , 
Stockholm, 1931. Translation under the title The Contribution of tho 
Cryptographic Bureaus in the World War , appeared in the Signal Corps 
Bulletin in sevon successive installments, from November-Dee ember 1933 
to November-Docember 1934, inclusive. 

Nikolaieff, A. M. Secrot Causes of German success on the Eastern Front . 
Coast Artillery Journal, Soptember-Octobor, 1935. 
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50. Analytical key for cryptanalysis. - a. It may be of assistance 
to indicate, by means of an outline, the relationships existing among 
the various cryptographic systems thus far considered. This graphic out¬ 
line will be augmented from time to time as the different cipher systems 
are examined, and will constitute what has already been alluded to in 
Par.. 6 ^ and there termed an analytical key for cryptanalysis. 1 Funda¬ 
mentally its nature is that of a schematic classification of the different 
systems examined. 

b. Note, in the analytical key, the rather clear-cut, dichotomous 
method of treatment, that is, classification by subdivision into pairs. 

For example, in the very first step there are only two alternatives: the 
cryptogram is either (l) cipher, or (2) code. If it is cipher, it is 
either (l) substitution (2) transposition. If it is a substitution cipher, 
it is either (l) monographic, or (2) polygraphic, and so on. If the stu¬ 
dent will study the analytical key attentively, it will assist him in 
fixing in mind the manner in which the various systems covered thus far 
are related to one another, and this will be of benefit in clearing away 
some of the mental fog or haziness from which he is at first apt to suffer. 

This analytical key is quite analogous to the analytical keys usually 
found in the handbooks biolcgists commonly employ in the classification and 
identification of living organisms. In fact, there are several peints of 
resemblance between, for example, tl^at branch of biology called taxonomic 
botany and cryptanalysis. In the former the first steps in the classifi- 
catory process are based upon observation of externally quite marked differ¬ 
ences;^ the’ process continues, the observational details become finer and 
finer, involving more and more difficulties as the work progresses. Towards 
the end of the work the botanical taxonomist may have to dissect the speci¬ 
men and study internal characteristics. The wh«le process is largely a 
matter of painstaking, accurate observation of data and drawing proper con¬ 
clusions therefrom. Except for the’fact that the botanical taxonomist de¬ 
pends almost entirely upon ocular observation of characteristics while the 
cryptanalyst in addition to observation must use some statistics, the steps 
taken by the former are quite similar to those taken by the latter. It is 
only at the very end of the work that a significant dissimilarity between 
the two sciences arises. If the botanist makes a mistake in observation 
or deduction, he merely fails to identify the specimen correctly; he has an 
"answer" — but the answer is wrong. He may not be cognizant of the error; 
however, other more skillful botanists will find him out. But if the 
cryptanalyst makes a mistake in observation or deduction, ho fails to get 
any "answer" at all; he needs nobody to te'll him he has failed. Further, 
there is one additional important point of difference. The botanist is 
studying a bit of Nature -- and she does not consciously interpose obstacles, 
pitfalls, and' dissimulations in the path of those trying to solve her mys¬ 
teries. The cryptanalyst, on the other hand, is studying a piece of writing 
prepared with the express purpose of preventing its being read by any per¬ 
sons for whom it is not intended. Tho obstacles, pitfalls, and dissimula¬ 
tions are hero consciously interposed by the one who cryptographed tho 
message. These, of course, are what make cryptanalysis differont and diffi¬ 
cult . 
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c. Tho numbers in parentheses refer to specific paragraphs in this 
text, so that tho student may readily turn to tho text for detailed in¬ 
formation or for purposos of refreshing his memory as to procodure. 

4.. In addition to those reference numbers thoho have boon affixed 
to the successive stops in the dichotomy, numbers that mark the "routes" 
on the cryptanalytic map (the analytical koy) which tho student cryptana¬ 
lyst should follow if ho wishes to facilitate his travels along tho rather 
complicated and difficult road to succoss in cryptanalysis, in somewhat 
tho sumo way in which‘an intelligent motorist follows the routes indicated 
on a geographical map if ho wishes to facilitate his travels along unfamiliar 
roads. The analogy is only partially valid, howover. Tho motorist usually 
knows in advance tho distant point which ho dosiros to roach and he proceeds 
thereto by tho bost and shortost routo, which he finds by observing the 
route indications on u map and following tho routo markors on the road. 
Occasionally ho encountors a detour but thoso are unoxpected difficulties 
as a rule. Least of all does he anticipate any nocossity for journeys 
down what may soon turn out to bo blind alloys and "dead-end" stroots, 
forcing him to double back on his way. Nov the cryptanalyst also has a 
distant goal in mind — tho solution of tho cryptogram at hand — but he 
does not know at tho outset of his journey the exact spot whore it is located 
on tho cryptanalytic map. The map contains many routos and ho procoeds to 
tost them one by ono, in a successive' chain. He encounters many blind 
alleys and doad-ond stroots, which forco him to retrace his steps; he makes 
many detours and jumps many hurdles. Some of those rotracings of steps, 
doubling back on his tracks, jumping of hurdles and detours are unavoidable, 
but a few are avoidable. If properly employed, tho analytical koy will 
help the careful student to avoid thoso which should and can be avoided; 
if it does that much it will servo tho principal purpose for which it is 
intended. 


e. Tho analytical koy may, however, servo another purposo of a some¬ 
what different nature. Whon a multitude of cryptographic systems of di¬ 
verse typos must be filod in somo systematic manner apart from tho names 
of the correspondents or othor roforonco data, or if in conducting in¬ 
structional activities classificatory designations are desirable, tho ref¬ 
erence numbers on the analytical koy may bo made to serve as "typo numbers". 
Thus, instead of stating that a given cryptogram is a koyword-systemati- 
cally-mixod-monoliteral-monoalphabotic-monographic substitution cipher ono 
may say that it is a "Type 901 cryptogram". 

f. Tho method of assigning type numbers is quite simple. If tho stu¬ 
dent will examine tho numbers ho will note that successive levels in the 
dichotomy are designatod by successive hundreds. Thus, tho first lovel, 
tho classification into cipher and code is assigned the numbers 101 and 
102. On tho socond level, under cipher, the classification into monographic 
and polygraphic systems is assigned tho numbers 201 and 202, etc. Numbors 
in the same hundreds apply thorofore to systems at tho same level in the 
classification. There is no particular virtuo in this scheme of assigning 
type numbers except that it provides for a considerable degree of expansion 
in future studies. 
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TABLE 1-A 




Absolute frequencies of letters appearing in five 

sets of Govcrn- 


mental plain-text telegrams. 

c_ch set containing 10,000 

letters. 



Arranged alphabetically. 



Message 

Message 

Message 

Message 

Message 

♦ 

No. 1 

No. 2 

No. 3 

No. 4 

No. 5 


t» 

t>.. 

s» 

!*> 

* 


5 3 

0 o 

5 s 

£ S 


* 

ft M § 

© 1 1 

ft 2 $ 

In 2 © 

ft 5 S 



■*» o o 1 

45 o & 

® O & 

® O O’ 


i-S *< h 

1 II 

1 II 

3 l| 

S II ■ 


A - 738 

A - 783 

A - 681 

A - 740 

A - 741 


B - 104 

B - 103 

B - 98 

B - 33 

3 - S9 


C - J519 

0 - 300 

G - 238 

C - 526 

G - 501 


D - 387 

D - 413 

D - 423 

D - 451 

D - 448 


E -1367 

E -1294 

E -1292 

E -1270 

E -1275 


F - 253 

F - .287 

F - 308 

F - 287 

F - 281 


G - 166 

G - 175 

G - 161 

G - 167 

G - 150 


H - 310 

H - 351 

H - 335 

K - 349 

H - 349 


I - 742 

I - 750 

I - 787 

I - 700 

I - 'Wl 


J - 18 

J - 17 

J - 10 

J - 21 

J - 16 


K - 36 

E - 38 

K - 22 

K - 21 

X - 31 


L - 365 

L - 393 

L - 333 

L - 386 

L - 344 

' 

M - 242 

M - 240 

M - 238 

M - 249 

M - 268 


N - 786 

N - 794 

N - 815 

N - 800 

N - 780 


0 - 685 

0 - 770 

0 - 791 

0 - 756 

0 - 762 


P - 241 

P - 272 

P - 317 

P - 245 

P - 260 


Q - 40 

Q - 22 

Q - 45 

Q - 38 

Q - 30 


R - 760 

R - 745 

R - 762 

R - 735 

R - 796 ' 


S - 658 

S - 503 

S - 585 

S - 628 

S - 604 


T - 936 

T - 879 

T - 894 

T - 958 

T - 928 


U - 270 

U - 233 

U - 312 

U - 247 

U - 238 


V - 163 

V - 173 

V - 142 

V - 133 

V - 155 


W - 166 

W - 163 

T. - 136 

V - 133 

V. r - 182 


X - 43 

X - 50 

X - 44 

X - 53 

X - 41 


i -"191 

Y - 155 

Y - 179 

Y - 213 

Y - 229 


Z - 14 

Z - 17 

Z - 2 

Z - .,11 

Z - 5 


Tptalc 






10,000 

10,000 

10,000 

10,000 

10,000 




Table 2-A 




Absolute, frequencies of 

letters appearing in the combined five sets 


of messages totalling 50,000 

letters arr: nged alphabetically. 


A - 3683 

G - 819 

L - 1821 

Q - 175 

V - 766 


B - 487 

H - 1694 

M - 1237 

R - 5788 

W - 780 


C - 1534 

I - 3676 

N - 3975 

S - 3058 

X - 231 


D - 2122 

J - 82 

0 - 3764 

T - 4595 

Y - 967 


E - 6498 

K - 148 

P - 1335 

TJ - 1300 

Z - 49 H 


F - 1416 

" i 




1 
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TABLE 1-B 






1 Absolute frequencies of letters app 

earing in five 

sets of Gov- 


■ ernment pi?-in-text telegrams, eae v 

> set containing 10,000 letters. 



Arranged according to frequency. 




Message 

Message 

Message 

Message 

Message 


No.' 'i* 

No. 2 

No. 

, 3 

No. 

, 4 

No. 

: 5 

* 


>» 


>* 


!» 




CD o 
■P P 

s a 


5 S 


CD O 
p Pi 


2-8 


& II 

S 5 s, 

u 

<0 

3 3 

$5 ; 

3 % 

55 ; 

5 S 


j +> A ai 

4» g O 1 


8 g 1 

-p 

O O' 

■p 

a sr 


1 Ifi 

^ N 

<0 

p] 

3 & 

CD 

SI 

CD , 

p? ! 

| 2 


E -1367 

F, -1294 

E -1292 

E -1270 

E -1275 


T - 936 

T - 879 

T - 

894 

T - 

958 

T - 

928 


N - 78'p 

N - 794 

N - 

815 

N - 

800 

R - 

786 


R - 76® 

A - 783 

0 - 

791 

0 - 

756 

N - 

780 


I - 742 

0 - 770 

I - 

787 

A - 

740 

0 - 

762 


A - 738 

I - 750 

R - 

762 

R - 

735 

A - 

741 


0 - 685 

R - 745 

A - 

681 

I - 

700 

I - 

697 


S - 658 

S - 583 

S _ 

585 

S - 

628 

S - 

604 


D - 387 

D - 413 

r> - 

423 

D - 

451 

D - 

448 


L - 365 

L - 393 

H - 

335 

L - 

386 

H - 

349 


C - 319 

H - 351 

L - 

333 

H - 

349 

L - 

344 


H - 310 

C - 300 

P - 

317 

C - 

326 

C - 

301 


U - 270 

F - 287 

U - 

312 

F - 

287 

F - 

281 

' 

F - 253 

P - 272 

F - 

308 

M - 

249 

M - 

268 


M - 242 

M - '240 

C - 

288 

U - 

247 

P - 

260 


P - 241 

U - 233 

M - 

238 

P - 

245 

U - 

238 


Y - 191 

G - 175 

Y - 

179 

Y - 

213 

Y - 

229 


G - 166 

V - 173 

G - 

161 

G - 

167 

W - 

182 


W - 166 

VI - 163 

V - 

142 

V - 

135 

M - 

155 


V - 163 

Y - 155 

t; - 

136 

W - 

133 

G - 

150 


B - 104 

B - 103 

B - 

98 

B - 

83 

B - 

99 


X - 43 

X - 50 

Q - 

45 

X - 

53 

X - 

41 


Q - 40 

K - 38 

X - 

44 

Q - 

38 

K - 

31 


K - 36 

Q - 22 

K - 

22 

K - 

21 

Q - 

50 


J - 18 

J - 17 

J - 

10 

J - 

21 

J - 

16 


Z - 14 

Z - 17 

2 — 

2 

Z - 

11 

z - 

_5_ 


Totals 









10,000 

10,000 

10 

,000 

10 

,000 

10 

,000 




TABLE 1-C 






■ Absolute frequencies of vowels, high frequency consonants, 



■ medium frequency consonants, and 

low frequency consonants appearing 


B in five 

sets of Government plain- 

text telegrams 

, each 

set cont 

lining 


B 10,000 letters. 








Message 


High Freq. 

Medium Freq 


Low Freq 



No. 

Consonants. 

Consonr nts. 


Consonant 

s. 


1 

3993 

3527 


2329 


151 



2 

3985 

3414 


2457 


144 



3 

4042 

3479 


2356 


123 



4 

3926 

5572 


2356 


144 



5 

3942 

3546 


2389 


123 



Totals 

19,388 

17,538 

11,889 


685 

50,000 
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TABLE 2-E 

Absolute frequencies of letters, appearing in the combined five 
sets of messages totailing 50,000 letters arranged according to 
frequencies. 


E -§498 

. I -3676 

C -1534 

Y - 

967 

X 

- 231 

T -4C95- 

G -3058 

F -1416 

G - 

819 

Q 

- 175 

N -3975 

D -2122 

P -1335 

W - 

780 

K 

- 148 

R -3788 

L -1821 

U -1300" 

V - 

766 

J 

- 82 

0 -3764 

H -1694 

M -1237 

B - 

487 

Z 

- 49 


A -5683 

TABLE 2-C 

Absolute frequencies of vowels, high frequency consonants, 
medium_frequency consonants, anrt low frequency consonants appearing 
in t!he" combined five sots of messages totalling 50,000 letters. 

Vowels . . .. 19,888 

High "frequency Consonants (D, N, R, S, and T). 17,538 

Medium Frequency Consonants (B , C, F, G, II, L, M, P, V and W) 11,889 

Low Frequency Consonants (J, K, Q, X and Z). 685 

Total" . 50,000 


TABLE 2-D 

Absolute frequencies'of letters as initial letters of 10,000 
words found in Government plain-text telegrams. 

(l) Arranged alph&botecally 


A - 905 

G - 109 

L - 196 

Q - 30 

V - 77 

B - 287 

H - 272 

M - 384 

R - 611 

7/ - 320 

C - 664 

I - 34-1 

N - 441 

S - 965 

X - 4 

D - 525 

J - 44 

0 - 646 

T -1253 

Y - 88 

E - 390 

F - 855 

K - 23 

P - 453 

U - 122 

Z - 12 




Total 

10,000 


(2) Arranged according to absolute frequencies. 

T -1253 

R - 611 

M - 384 

L - 196 

J - 44 

S~'965 

D - 525 

I - 344 

U - 122 

Q. - 30 

A - 9C5 

N - 441 

W - 320 

G - 109 

K - 23 

F - 855 

P - 433 

B 287 

1-38 

Z - 12 

C -..664 

0 - 646 

E - 390 

H - 272 

V - 77 

X - 4 


Total 


10,000 
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TABLE 2-E 

Absolute frequencies of letters as fiml letters of 10,000 
words found in Government plrin-text telegrams. 


(1) Arranged alphabetically. 


A - 269 

B - 22 

C - 86 

D -1002 

E -1628 

F - 252 ... 

0 - 225 

II - 450 
1-22 

J - 6 

K - 53 

L - 354 
to - 154 

N - 872 

0 - 575 

P - 213 

Q - 8 

R - 769 

S - 962 

T -1007 

U - 51 

V - 4 

W - 45 

X - 113 

Y - 866 

Z - 9 





Total 

10,000 

(2) 

Arranged according to absolute froqurncios. 

E -1628 

P - 769 

F ■ 

- 252 

C - 86 

1-22 

T -1007 

0 - 575 

G 

- 225 

K - 53 

Z - 9 

D -1002 

H - 450 

P 

- 213 

W - 45 

Q - 3 

S - 962 

L - 354 

M 

- lc>4 

U - 31 

j - 6 

N - 872 

A - 269 

X 

- 116 

_ 3 - 22 

V - 4 

Y - 866 










Tot-.l 

10,000 




TABLE 3 



Relative frequencies of 

1 letters 

appearing in 1 

,0C0 letter; 

based upon table 2. 






(1) 

Arranged alphabetic.'lly 


A - 73.66 

G - 16.38 

L 

- 36.42 

Q - 5.30 

7 - 15.32 

B - 9.74 

H - 33.88 

H 

- 24.74 

R - 75.76 

W - 15.60 

C - 30.68 

I - 73.52 

H 

- 79.50 

S - 61.lt 

X - 4.62 

D - 42.44 

J - 1.64 

0 

- 75.28 

T - 31.9C 

Y - 19.34 

E -129.96 

K - 2.96 

P 

- 26.70 

U - 26.Of. 

Z - .98 

F - 28.32 










Total 

1000.00 


(2) Arranged 

according to fr ;qu^r.cy. 


E -129.96 

I - 73.52 

0 

- 30.68 

I - 19.3- 

X - 4.62 

T - 91.90 

S - 61.16 

F 

- 28.32 

G - 16.38 

Q - 3.50 

N - 79.50 

D - 42.44 

P 

- 26.70 

W - 15.60 

K - 2.96 

R - 75.76 

L - 36.42 

U 

- 26.00 

V - 15.32 

J - 1.64 

0 - 75.28 

H - 33.88 

M 

- 24.74 

B - 9.74 

Z - .98 

A - 73,66 







Total 1000.00 
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(3) (4) (5) (6) 



JTowels 

High Freq. 
Consonants 


Medium Frequency 
Consonants 

Low Freq. 
Consonants 

t 

- 73.G6 

D - 42.44 

B 

- 9.74 

L - 36.42 

X - 4.62 

t 

-129.96 

N - 79.50 

C 

- 30.68 

M - 24.74 

Q - 3.50 

X 

- 73.52 

R - 75.76 

F 

- 28.52 

P - 26.70 

K - 2.96 

q 

- 75.28 

S - 61.16 

G 

- 16.38 

V - 15.32 

J - 1.64 

l 

- 26.00 
- 19.34 

T - 91.90 

H 

- 33.88 

W - 15.60 

Z - .98 

Petals 

397.76 

350.76 



237.78 

15.70 


Total — 1000.00 


TABLE 4 

Frequency Distribution for 10,000 letters of literary English, 
as compiled by Hitt.l 


A. Alphabetically arranged . 


A - 778 

G 

- 174 

L - 372 

Q - 8 

V 

- 

112 

B - 141 

H 

- 595 

M - 288 

R - 651 

W 

- 

176 

C - 296 

T 

- 667 

R. - 686 

S - 622 

X 

- 

27 

D - 402 

J 

- 51 

0 - 807 

T - 855 

Y 

- 

196 

E -1277 

F - 197 

K 

- 74 

P - 223 

U - 308 

Z 

“ 

17 


B. 

Arranged 

according to 

frequency. 




E -1277 

R 

- 651 

U - 308 

Y - 196 

K 

_ 

74 

T - 855 

S 

- 622 

C - 296 

W - 176 

J 

- 

51 

0 - 807 

H 

- 595 

M - 288 

G - 174 

X 

- 

27 

A - 778 

D 

- 402 

P - 223 

B - 141 

Z 

- 

17 

» - 686 

I - 667 

L 

- 372 

F - 197 

V - 112 

Q 

- 

8 


TABLE 5 


Frequency Distribution for 10,000 letters of telegraphic English 
as compiled by Hitt. 




A. Alphabetically 

arranged. 



A - 813 

H - 201 

L - 392 

Q - 38 

V - 136 


B - 149 

H - 386 

M - 273 

R - 677 

W - 166 


C - 306 

I - 711 

N - 718 

S - 656 

X - 51 


D - 417 

J - 42 

0 - 844 

T - 634 

Y - 203 


E -1319 

K - 88 

P - 243 

U - 321 

Z - 6 


F - 205 







B. Arranged according to frequency. 



E -1319 

S - 656 

D - 321 

F - 205 

K - 88 


0 t 844 

T - 634 

C - 306 

G - 201 

X - 51 


A - 813 

D - 417 

M - 273 

7/ - 166 

J - 42 


N - 718 

L - 392 

P - 243 

B - 149 

Q - 38 


I - 711 

H - 386 

Y - 208 

V - 136 

Z - 6 


ft t 677 






1- Ijitt, Capt. Parker. Manual for the Solution of Military Ciphers. 
Army Service Schools Press, *’ort Leavenworth, Kansas, 1916. 
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TABLE 7-A 

THE 438 DIFFERENT DIGRAPHS OF 
TABLE 6 ARRANGED ACCORDING TO 
THEIR ABSOLUTE FREQUENCIES. 


I 

* 


E N 

111 

0 R 

3 1 

&E 

1 a 

R R 

1 1 

R E 

9 8 

N 1 

3 0 

E F 

1 8 

U E 

1 1 

E R 

a 7 

R 1 

3 0 

N 0 

1 8 

F T 

1 1 

N T 

8 a 

E L 

29 

P R 

1 8 

S U 

1 1 

T H 

7 a. 

H T 

2 8 

A 1 

1 7 

Y F 

1 1 

0 N 

77 

L A 

2 8 

H K 

1 7 

Y S 

1 1 

1 N .. 

7 5 

R 0 

2 8 

P 0 

1 7 

Y 0 

1 0 

T E 

7 1 

T A 

2 8 2.V9S 

K [) 

1 7 

F E 

1 0 

A hi 

6 4 

L L 

27 

T R 

1 7 

1 F 

1 0 

0 R 

6 4 

A D 

2 7 

D 0 

1 6 

L Y 

1 0 

S T . 

iS 3 

0 1 

2 7 

D T 

15 

M 0 

1 0 

E D 

6 0 

E 1 

2 7 

1 X 

1 5 

S P 

1 0 

M E 

5 7 

1 R 

27 

9 U 

1 5 

Y E 

9 

V E 

5 7 

) T 

27 

S 0 

1 5 

F R 

9 

£ a. 

5 4 

N G 

27 

Y T 

15 

1 M 

9 

N 0 . 

i? 2 

M E 

2 6 

A C 

1 4 

L D 

9 

TO. 

5 o 

N A 

2 6 

A hi 

1 4 

M 1 

9 

r ...S E 

4 9jay^ a 

S H 

26 

C H 

1 4 

hi F 

9 

A T_ 

,4 7 

1 V 

3 5 

C I 

1 4 

R C 

9 

t i; 

.4 5 

fi F 

8 5 

E M 

1 4 

R H 

9 

AH, 

*4 4 

0 M 

2 5 

G E 

1 4 

R Y 

9 

e e r 

„4 3 

0 P 

2 5 

U S 

1 4 

D D 

8 

R T 

4 3 

IM S 

24 

P A 

1 4 

N N 

8 

A S 

4 1 

S A 

24 

P L 

1 3 

0 F 

8 

C 0. 

-4 1 

.1 L 

23 

R P 

1 3 

1 A 

8 

1 0 

4 1 

P E 

23 

S C 

1 3 

H U 

8 

T V 

.4 1 

1 C 

2 2 

W 1 

1 3 

L T 

'8 

F 0... 

.4 0 

W E 

2 2 

M M 

1 3 

H P 

8 

F l , 

39 

U iM 

2 1 

D S 

1 3 

0 C 

8 

R A 

39 

C A 

2 0 

A U 

1 3 

0 W 

8 

E T 

3 7 

E P 

2 0 

1 E 

13 , 

P T 

8 

0 u_ 

3 7 

.E'V 

2 0 

L 0 

... ..l_33Z^f 

U G 

8 

L E 

3 7 

G H 

2 0 

A P 

1 2 

A V 

7 

ivl A... 

3 6 

H A 

20 

D R 

1 2 

B Y 

7 

T W 

.3 6 

H E 

2 0 

e y 

1 2 

C 1 

7 

E A 

3 5 

H 0 

. 20 

A Y 

1 2 . 

E h 

7 

1 S 

3 5 

L 1 

3 0 

E 0 

1 2 

0 A 

7 

S f 

3 4 

S S 

19 

0 D 

1 2 

E W 

7 


3 3 

T T 

19 . 

S F 

3. 2 

E X 

7 

h L 

. 3 3 

1 G 

19 . 

U S 

1 2 

G A 

7 

. A L 

3 2 

, N C 

1 9 

U T 

1 2 

1 P 

7 

C E 

3 2 

0 L 

1 9 

.V 1 

1 2 

i\l U 

7 

D A' 

3 3 

.0 T 

19 

W A 

1 2 

0 V 

7 

F C 

,3 2 

T S 

1 9 

.F F 

1 1 

R G 

7 

R S 

3 1 

W u 

19 

.P P 

1 1 

R l\| 

7 


1 Thq 3-8 digraphs above this line compose of the total. 
,1 The 53 digraphs above this lino compose CO'o of the total. 
* thq 117 digraphs above this line compose 7-5*4 of the total, 
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TABLE 7 A CONCLUDED 


T F 7 
T N 7 
XT 7 
A B 6 
AG 6 
B L 6 
0 0 6 

Y A 6 
GO 6 
I D 6 
K E 6 
L S 6 
MB 6 
P I 6 
PS 6 
H F 6 
T C 6 
T D 6 
T M 6 
U L 6 

Y A 6 

Y N 6 

CL 5 
DM 5 
DP 5 
DU 5 
0 1 5 
U A 5 
U I 5 
FA 5 
G 1 5 

GK 5 
H F 5 
NL 5 
N M 5 
NY 5 
RL 5 
K U 5 
RV 5 
S D 5 
SR 5 
T L 5 
T U 5 
U M 5 
A F 4 
BA 4 
BO 4 
C K 4 
C R 4 
C U 4 


D B 4 
DC 4 

D N 4 

D W 4 

E B 4 

EG 4 

E Y 4 

G T 4 

H S 4 

M S 4 

N H 4 

NR 4 

OB 4 

PM 4 

R W 4 

S N 4 

S W 4 

W H 4 

Y C 4 

YU 4 

Y R 4 

PH 3 

P U .5 

R H 3 

SB 3 

S M ‘3 

T B 3 

U B 3 

U C 3 

U U 3 

Y P 3 

C C 3 

AW 3 

DL 3 

D V 3 

A A 3 

E U 3 

0 E 3 

Y I 3 

F S 3 

F U 3 

G N 3 

G S 3 

H C 3 

H N 3 

LB 3 

L C 3 

L F 3 

L P 3 

HC 3 


N P 3 
N V 3 
MW 3 
OH 3 
AH 3 
A K 3 
B I 3 
R R 3 
B U 3 
D G 3 
U H 3 
DO 3 
AO 3 
0 Y 3 
F C 3 
F L 3 
G C 3 
G F 3 
G L 3 
G P 3 
G U 3 
H D 3 
H M 3 
I B 3 
I K 3 
I Z 3 
J E 3 
JO 3 
J U 3 
K I 3 
L M 3 
L R 3 
L U 3 
L V 3 
L W 3 
M R 3 
M T 3 
M U 3 
M Y 3 
N B 3 
N K 3 
0 G 3 
OK 3 
P F 3 
R B 3 
S G 3 
S L 3 
TP 3 
UP 3 
W N 3 


X A 3 

X C 3 

X I 3 

X P 3 

Y B 3 

Y L 3 

Y hi 3 

Z E 3 

G G 1 

A J 1 

R J 1 

PM 1 

n H 1 

B T 1 

CD 1 

C F 1 

C M 1 

C N 1 

C S 1 

C W 1 

C Y 1 

D J 1 

n Y 1 

E J 1 

A E 1 

U 0 1 

YU 1 

E Z 1 

F 0 1 

F G 1 

F to 1 

F P 1 

F W 1 

F Y 1 

GO 1 

G J 1 

G M 1 

GW 1 

HR 1 

HL 1 

HP 1 

HQ 1 

H W 1 

H Y 1 

J A 1 

K A 1 

K C 1 

K L 1 

K N 1 

K S 1 


LG 1 
LH 1 
L N 1 
M D 1 
MF 1 
M H 1 
M J 1 
N Q 1 
0 J 1 
OX 1 
PR 1 
PC 1 
P 0 1 

PN 1 
P V 1 
P W 1 
P Y 1 
Q M 1 
tt R 1 
R J 1 
R K 1 
S K 1 
S V 1 
S Y 1 
T G 1 
TO 1 
T Z 1 
U F 1 
U V 1 
VO 1 

Y T 1 

w L 1 
W R 1 
W S 1 
W Y 1 
X D 1 
X E 1 
X F 1 
X H 1 
X N 1 
X 0 1 

X R 1 
X S 1 

Y G 1 

Y H 1 

Y W 1 
Z A 1 
Z i 1 


Total 5000 
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li, " TABLE 7-B 

■ THE 13 DIGRAPHS C0M"0SIN& 25'S OF THE DIGRAPHS 
IN TABLE NO. 6. ARRANGED ALPIUBETI^ALLY 
ACCORDING TO THEIR INITIAL LETTERS, 

*T? - 

LI- ’ (1) (?) 

AND ACCORDING TO THElR AND ACCORDING TO THEIR 

‘ -FINAL LETTERS. ABSOLUTE FREQUENCIES. 


AN 

64 

RE 

- 98 

AN 

64 

•.RE 

98 

Ed 

GO 

SE 

49 

EN 

111 

SE 

49 

£N 

111 

ST 

63 

ER 

87 

ST 

63 

|r 1 

87 



ED 

60 



ES 

‘ 64 

TE 

71 

ES 

54 

TH 

78 



TH 

78 



TE 

71 

IN 

, 75 

TO 

50 

IN 

75 

TO 

50 

ND ' 

'52 

VE 

57 

NT 

82 

VE 

57 

P 

NT 

57 

,_82 

Total 

1249 

NE 

ND 

57 

52 

Total 

1249 

CN 

77 



ON 

77 



OR 

64 



OR 

64 
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TABLE 7-C 


THE 03 DIGRAPHS COMPOSING 50^ OF THE 5000 DIGRAPHS OF TABLE 6, 
ARRANGED ALPHABETICALLY ACCORDING TO THEIR INITIAL LETTERS', 


( 1 ) ( 2 ) 

AND ACCORDING TO THEIR AND ACCORDING TO THEIR 

FINAX LETTERS ABSOLUTE FREQUENCIES 


A L 

3 2 

IM D 

5 2 

A ivt 

6 4 

IM T 

8 2 

A N 

6 4 

IM E 

5 7 

A T 

4 7 

iM E 

5 7 

A R 

4 4 

N 1 

3 0 

A R 

4 4 

iM 0 

5 2 

A 8 

4 1 

IM T 

8 2 

A 8 

4 1 

IM 1 

3 0 

A T 

4 7 



A L 

3 2 





0 N 

7 7 



0 IM 

77 

C £ 

3 2 

0 H 

to 4 

C 0 

4 1 

0 R 

6 4 

C 0 

4 1 

0 U 

3 7 

C E 

3 2 

0 U 

3 7 

D A 

3 2 

R A 

3 9 

0 E 

3 3 

R E 

9 8 

D E 

3 3 

R E 

9 8 

0 A 

3 2 

R T 

4 2 

E A 

35 

R 1 

3 0 



K A 

3 9 

E C 

32 

R 0 

2 8 

E N 

111 

r a 

3 1 

E D 

6 0 

R S 

3 1 

E R 

B 7 

R 1 

3 0 

E E 

4 2 

R T 

4 2 

E D 

6 0 

R 0 

2 8 

E L 

29 



E 8 

5 4 



E IM 

1 11 

a E 

4 9 

E E 

4 2 

S T 

6 3 

E R 

a 7 

a I 

3 4 

E T 

37 

S E 

4 9 

e a 

5 4 

a t 

to 3 

E A 

35 

3 1 

3 4 

E T 

3 7 



E C 

3 2 





T A 

2 8 

E L 

29 

T H 

78 

F 1 

3 9 

T E 

7 1 



T E 

7 1 

K 0 

4 0 

T H 

7 8 

F 0 

4 0 

T 0 

5 0 



T 1 

4 5 

F 1 

39 

T 1 

4 5 

H 1 

33 

TO 

5 0 

H 1 

3 3 

T Y 

4 1 

H T 

2a 

T W 

36 

H T 

28 

T W 

3 6 



T Y 

4 1 



T A 

2 8 

1 IM 

7 5 



1 IM 

7 5 



1 0 

4 1 

U R 

3 1 

1 0 

4 1 

U R 

3 1 

1 S 

35 



1 a 

35 



L A 

L E 

28 

3 7 

V E 

5 7 


37 

28 

V E 

5 7 

Total - 

2495 

L E 

L A 

Total - 

2495 

Li A 

36 



• 4 A 

3 6 
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j?ABLE 7-f> * r ^ 

v r£ffl5 117 DIGRAPHS COSsPOSlNG 751$ OF THE 5O00 DIGRAPHS OF 
*i TABLE 6, ARRANGED ALPHABETICALLY ACCORDING TO THEIR 

r • initial letters, 

" (1) AND ACCORDING T6 THEIR FINAL LETTERS. 


l 


A C 

14 

H A 

2 0 

P A ' 

1 4 

A D 

27 

H E 

2 0 

P E 

2 3 

A 1 

17 

H 1 

3 3 

P 0 

1 7 

A L 

3 2 

H 0 

2 0 

P R 

1 8 

A M 

1 4 

H R 

1 7 



A N 

6 4 

H’T 

2 8 

Q U 

1 5 

A R 

4 4 





A S 

4 1 

1 C 

22 

H A 

39 

A T 

4 7 

1 E ’ 

1 3 

K D 

1 7 

A II 

1 3 

l G 

19 

R E 

9 8 

i 


1 L 

23 

R 1 

3 0 

BE 

18 

1 74 

7 5 

K U 

2 8 



1 0 

4 1 

R 8 

3 1 

C A 

20 ' 

1 R 

27 

R T 

4 2 

C E 

32 

1 S 

3 5 



CH 

1 4 

1 T 

27 

S A 

24 

CO 

4 1 

1 V 

25 

S E 

4 9 

C T 

14 

1 X 

1 5 

S H 

26 

■■ 




S 1 

3 4 

D A 

32 

L A 

28 

S 0 

1 5 

DE 

33 

L E 

37 

S 8 

19 

D i 

27 

L 1 

2 0 

S T 

6 3 

D 0 

16 

L L 

27 



D S 

13 

L 0 

1 3 

T A 

2 8 

D T 

15 



- T E 

7 1 



\A A 

3 6 

T H 

7 8 

E A 

35 

M E 

26 

T 1 

4 5 

E C 

3 2 



T 0 

5 0 

E D 

6 0 

N A 

26 

T R 

17 

E E 

4 2 

N C 

1 9 

T vS 

19 

E F 

18 

N 0 

5 2 

T T 

19 

E 1 

27 

N E 

5 7 

T W 

3 6 

E L 

29 

N G 

27 

T Y 

4 1 

E M 

14 

N 1 

3 0 



E N 

111 

IM 0 

1 8 

U N 

2 1 

E P 

20 

N S 

24 

U R 

3 1 

E R 

8 7 

NT 

8 2 



E S 

5 4 



V E 

5 7 

ET 

3 7 

0 F 

2 5 



E V 

2 0 

0 L 

1 9 

W E 

22 



0 M 

2 5 

W 0 

19 

F 1 

39 

0 N 

77 



F 0 

4 0 

0 P 

2 5 

Y T 

15 

G E 

14 

Q R 

0 s 

6 4 

1 4 

Total - 

3745 

» G H 

2 0 

0 T 

19 






3 7 

» ' *- 

• - ■ ' t 



. ^ 

• T V 





~ - 

: -& 

■ : srf 



- J.OU — 

REF ID: A64644 
TABLE 7-u (ConcIuriPd) 

(2) AND ACCORDING TO THEIR ABSOLUTE FREQUENCIES. 


A N 

6 4 

H 1 

33 

P E 

2 3 

A T 

4 7 

H T 

2 8 

P R 

1 8 

A R 

4 4 

H A 

2 0 

P 0 

1 7 

A S 

4 1 

H E 

20 

P A 

1 4 

A L 

3 a 

H U 

2 0 



A n 

2 7 

ri K 

17 

y u 

1 5 

A 1 

17 





A C 

1 4 

1 N 

7 5 

H E 

9 8 

A M 

1 4 

1 0 

4 1 

K T 

4 2 

A U 

1 3 

1 S 

3 5 

R A 

39 



1 R 

27 

R S 

3 1 

BE 

1 B 

1 T 

27 

R 1 

3 0 



1 V 

2 5 

R 0 

28 

C 0 

4 1 

1 L 

23 

R 0 

1 7 

C E 

32 

1 C 

22' 



C A 

20 

1 G 

1 9 

S T 

6 3 

C H 

14 

1 X 

1 5 

S E 

49 

C T 

14 

1 E 

1 3 

S 1 

3 4 





S H 

2 6 

D E 

33 

L E 

37 

8 A 

2 4 

D A 

32 

L A 

28 

8 S 

19 

D 1 

2 7 

L L 

27 

S U 

15 

D 0 

16 

L 1 

20 



D T 

IS 

L 0 

1 3 

T H 

7 8 

0 S 

13 



T E 

7 1 



M A 

36 

T 0 

5 0 

E N 

111 

M E 

26 

T 1 

4 5 

E K 

8 7 



T Y 

4 1 

E D 

6 0 

N T 

8 2 

T W 

3 6 

t S 

5 4 

N E 

5 7 

T A 

2 8 

E E 

4 2 

N D 

5 2 

T 8 

19 

E T 

37 

N 1 

30 

T T 

19 

E A 

35 

H G 

27 

T R 

17 

E C 

3 a 

IM A 

26 



E L 

29 

0 0 

4 

U R 

3 1 

E 1 

2 7 

N C 

1 9 

U N 

2 1 

E P 

2 0 

N 0 

1 8 



E V 

20 



V E 

5 7 

E F 

1 8 

0 N 

7 7 



E M 

1 4 

0 K 

6 4 

W E 

22 



0 U 

37 

W 0 

19 

F 0 

4 0 

0 F 

2 5 



F 1 

39 

0 M 

25 

Y T 

1 5 



0 P 

2 5 



G H 

2 0 

0 L 

19 

Total - 

3745 

G E 

1 4 

0 T 

19 





0 S 

1 4 





TABLE 7-E 




ALL THE 438 DIGRAPHS OF TABLE 6, ARRANGED FIRST ALPHA¬ 
BETICALLY ACCORDING TO THEIR INITI/L LETTERS AND THEN ALPHA¬ 
BETICALLY ACCOPLING TO THEIR FINAL IETTERS. 

(SEE TABLE 6. READ ACROSS THE ROWS) 
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TABLE 8 

THE 438 DIFFERENT DIGRAPHS OF TABLE 6 ARRANGED FIRST 
ALPHABETICALLY ACCORDING TO THEIR INITIAL LETTERS, AND THEN 
ACCORDING TO THEIR ABSOLUTE FREQUENCIES UNDER EACH INITIAL 
LETTER. 


A i\| 

6 4 

C 0 

4 1 

A T 

47 

CE 

3 2 

A R 

4 4 

C A 

2 0 

A S 

4 1 

C H 

14 

A L 

32 

C T 

14 

A D 

27 

C 1 

7 

A 1 

1? 

C L 

5 

A C 

14 

C K 

4 

A ivl 

1 4 

C U 

4 

A LI 

1 3 

C U 

4 

A P 

1 2 

C C 

3 

A Y 

1 2 

C D 

1 

A V 

7 

C F 

1 

A B 

6 

C M 

1 

A G 

6 

C N 

1 

A F 

4 

C S 

1 

A A 

3 

C w 

1. 

A W 

3 

C Y 

1 

A H 

2 



A K 

2 

0 E 

3 3 

A 0 

2 

U A 

3 2 

A F. 

1 

l) 1 

2 7 

A J 

1 

l) 0 

16 



13 T 

15 

B E 

18 

U S 

1 3 

8 Y 

7 

U R 

12 

'B L 

6 

D D 

8 

8 A 

4 

U F 

8 

B 0 

4 

U M 

5 

B 1 

. 2 

13 P 

5 

BR 

2 

13 U 

5 

8 U 

2 

13 B 

4 

8 J 

1 

L) C 

4 

B wl 

1 

IJ N 

4 

B S 

1 

L) W 

4 

B T 

1 

0 L 

3 



13 V 

3 



U G 

2 



L) H . 

2 



13 Q 

2 



L) J 

1 



D Y 

1 


E R 

^8 7 

G H 

2 0 

E D 

6 0 

G E 

1 4 

E S 

5 4 

G A 

7 

E E 

4 2 

G 0 

6 

E T 

3 7 

G 1 

5 

E A 

35 

G R 

5 

E C 

3 2 

G T 

4 

E L 

29 

G IM 

3 

E 1 

2 7 

G S 

3 

E P 

2 0 

G C 

2 

E V 

2 0 

G F 

2 

E F 

1 8 

G L 

2 

E M 

14 

G P 

2 

E 0 

1 2 

G U 

P, 

E Q 

1 2 

G 0 

■ 1 

E N 

bill 

G G 

1 

E H 

7 

G J 

1 

E W 

7 

G 1.4 

1 

E X 

7 

G \i 

1 

E B 

4 



E G 

4 

H 1 

3 3 

E Y 

4 

H T 

2 8 

E U 

3 

H A 

2 0 

E J 

1 

H E 

2 0 

E Z 

1 

H 0 

2 0 



H K 

1 7 

F (J 

4 0 

H U 

8 

F 1 

39 

H F 

5 

F F 

1 1 

H S 

4 

F T 

1 1 

H C 

3 

F E 

1 0 

H N 

3 

F R 

9 

H D 

2 

F A 

5 

H M 

2 

F S 

3 

H B 

1 

F U 

3 

H L 

1 

F C 

2 

H P 

1 

F L 

2 

H Q 

1 

F D 

1 

H W 

1 

F G 

1 

H Y 

1 

F M 

1 



F P 

1 



F W 

1 



F Y 

1 
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TABLE 8 CONTINUED. 


1 14 

7 5 

L E 

3 7 

N T 

8 S3 

P E 

2 3 

1 0 

41 

L A 

2 8 

N E 

5 7 

P K 

18 

1 3 

35 

L L 

2 7 

N D 

5 2 

P 0 

1 7 

1 R 

27 

L 1 

2 0 

N 1 

3 0 

P A 

14 

1 T 

2 7 

L 0 

1 3 

N G 

2 7 

P L 

1 3 

1 V 

25 

L V 

1 0 

N A 

2 6 

P P 

1 1 

1 L 

23 

L 0 

9 

N S 

24 

P T 

8 

1 C 

22 

L T 

8 

N C 

19 

P 1 

6 

1 G 

19 

L 3 

6 

14 U 

1 8 

P 3 

6 

1 X 

15 

L ti 

3 

14 F 

9 

P M 

4 

I E 

1 3 

L C 

3 

14 N 

8 

P H 

3 

1 F 

1 0 

L F 

3 

14 U 

7 

P U 

3 

1 M 

9 

L P 

3 

14 L 

5 

P F 

2 

1 A 

8 

L U 

2 

14 M 

5 

P 8 

1 

1 P 

7 

L R 

2 

14 Y 

5 

P C 

1 

1 0 

6 

L U 

2 

14 H 

4 

P 0 

1 

1 U 

2 

L V 

2 

14 R 

4 

P 14 

1 

1 K 

2 

L IV 

2 

14 P 

3 

P V 

1 

1 Z 

2 

L G 

1 

i4 V 

3 

P 14 

1 



L H 

1 

H W 

3 

P Y 

1 

J E 

2 

L 14 

1 

14 B 

2 



J 0 

2 



14 K 

2 

(1 U 

1 5 

J U 

2 

H A 

3 6 

• N J 

1 

U M 

1 

J A 

1 

M E 

2 6 

14 R 

1 

a k 

1 



M M 

1 3 





K E 

6 

M 0 

1 0 

0 14 

7 7 

R E 

9 8 

K 1 

2 

h/f 1 

9 

0 R 

6 4 

R T 

4 2 

K A 

1 

M P 

8 

0 U 

3 7 

R A 

39 

K C 

1 

M B 

6 

0 F 

2 5 

R S 

3 1 

K L 

1 

t'A S 

4 

0 M 

2 5 

R 1 

3 0 

K in! 

1 

tvi C 

3 

0 P 

2 5 

. n o 

28 

K S 

1 

M R 

2 

0 L 

1 9 

R D 

1 7 



M T 

2 

0 T 

19 

R P 

1 3 



1.4 u 

2 

0 8 

1 4 

R R 

1 1 



m y 

2 

0 D 

1 2 

R C 

9 



wl D 

1 

0 C 

8 

R M 

9 



M F 

1 

0 w 

8 

R Y 

9 



M H 

1 

0 A 

7 

R G 

7 





0 V 

7 

R 14 

7 





0 0 

6 

R F 

6 





0 1 

5 

R L 

5 





0 B 

4 

R U 

5 





0 E 

3 

R V 

5 





0 H 

3 

R W 

4 





0 G 

3 

R H 

3 





0 K 

2 

R B 

2 





0 Y 

2 

R J 

1 





0 J 

1 

R K 

1 





0 X 

1 
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TABLE 8 CONCLUDED. 


8 T 

6 3 

U R 

3 1 

X T 

7 

S E 

4 9 

U N 

3 1 

X A 

2 

S 1 

3 4 

U S 

1 2 

X C 

2 

a h 

26 

U T 

1 2 

X 1 

2 

S A 

2 4 

U E 

1 1 

X P 

2 

.5 s 

19 

U G 

8 

X D 

1 

s 0 

15 

U L 

6 

X E 

1 

c 

1 3 

U A 

5 

X F 

1 


12 

U 1 

5 

X H 

1 

s u 

11 

U id 

5 

X N 

1 

S P 

1 0 

U B 

3 

X 0 

1 

a o 

5 

U C 

3 

X K 

1 

8 R 

5 

U D 

3 

X S 

1 

8 N 

4 

U P 

2 



-8 W 

4 

U F 

1 

Y T 

1 b 

S B 

3 

U 0 

1 

Y F 

1 1 

8 M 

3 

U V 

1 

Y S 

1 1 

8 G 

2 



Y U 

1 0 

, 8 L 

2 

V E 

5 7 

Y E 

9 

-S K 

1 

V 1 

12 

Y A 

6 

■8 V 

1 

V A 

‘6 

Y N 

6 

SY 

1 

V 0 

1 

Y C 

4 



V T 

1 

Y D 

4 

T H 

7 8 



Y R 

4 

• ,T E 

7 1 

W E 

22 

Y 1 

3 

T 0 

5 0 

WO 

19 

Y P 

3 

- T 1 

4 5 

‘W i 

1 3 

Y B 

2 

, T Y 

4 1 

W A 

12 

Y L 

2 

- T W 

36 

W H 

4 

Y M 

2 

- T A 

38 

W N 

2 

Y G 

1 

T S 

19 

W L 

1 

Y H 

1 

T T 

19 

W R 

1 

Y U 

1 

T R 

17 

W S 

1 

Y W 

1 

T F 

7 

W Y 

1 



T iJ 

7 



Z E 

2 

T C 

6 



Z A 

1 

T D 

6 



Z 1 

1 

T M 

6 





T L 

5 



Total - 

5000 

T U 

5 





T B 

3 





T P 

2 





T G 

1 





T fa! 

1 





T Z 

1 






(Note: For arrangement alphabetically first under initial 
letters and then under final letters, see Table 6.) 






R^f 2 ID: A64 644 

FABLE 9-A 

THE 438 DIFFERENT DIGRAPHS OF TABLE 6 ARRANGED FIRST 
ALPHABETICALLY ACCORDING TO Tffi'IR FINAL LETTERS AND THEN 
ACCORDING -TO THEIR ABSOLUTE FREQUENCIES. 


H A 

3 9 

E C 

3 2 

R E 

9 8 

W G 

2 7 

M A 

36 

1 C 

2 2 

TE 

7 1 

1 G 

19 

E A 

35 

N C 

19 

IM E 

5 7 

U G 

8 

D A 

3 2 

A C 

1 4 

V E 

5 7 

R G 

7 

L A 

2 8 

8 C 

13 

8 E 

4 9 

A G 

6 

T A 

28 

H C 

9 

EE 

4 2 

E G 

4 

N A 

26 

0 C 

8 

L E 

3 7 

D G 

2 

8 A 

24 

T C 

6 

D E 

3 3 

0 G 

2 

C A 

20 

u c 

4 

C E 

3 2 

S G 

2 

H A 

20 

Y C 

4 

-1 E 

S fa 

F G 

1 

P A 

1 4 

c c 

3 

P E 

2 3 

G G 

1 

W A 

12 

H C 

3 

W E 

2 2 

L G 

1 

1 A 

8 

L C 

3 

H E 

2 0 

T G 

1 

G A 

7 

yA C 

3 

B E 

1 8 

Y G 

i 

0 A 

7 

U C 

3 

G E 

1 4 



V A 

6 

F C 

2 

1 F. 

1 3 

T H 

7 8 

Y A 

6 

G C 

2 

U E 

1 1 

8 H 

26 

F A 

5 

X C 

2 

F E 

1 0 

G H 

2 0 

U A 

5 

K C 

1 

Y E 

9 

C H 

14 

B A 

4 

P C 

1 

K t 

6 

E H 

7 

A A 

3 



0 E 

3 

N H 

4 

X A 

2 

E D 

6 0 

J E 

2 

W H 

4 

J A 

1 

N 0 

5 2 

Z E 

2 

0 H 

3 

K A 

1 

A 0 

2 7 

A E 

1 

P H 

3 

Z A 

1 

R 0 

1 7 

X E 

1 

R H 

3 



0 D 

1 2 



A H 

2 

A B 

6 

L D 

9 

0 F 

2 5 

0 H 

2 

M B 

6 

0 D 

8 

E F 

1 8 

{. H 

1 

D B 

4 

1 D 

6 

S F 

1 2 

H H 

1 

E B 

4 

T D 

6 

F F 

1 1 

X H 

1 

0 B 

4 

8 0 

5 

Y F 

1 1 

Y H 

1 

L B 

3 

Y 0 

4 

1 F 

1 0 



S B 

3 

U D 

3 

N F 

9 



T B 

3 

H D 

2 

D F 

8 



U B 

3 

C D 

1 

T F 

7 



1 B 

2 

F D 

1 

R F 

6 



IM B 

2 

G D 

1 

H F 

5 



R B 

2 

M D 

1 

A F 

4 



Y B 

2 

PD 

1 

L F 

3 



H B 

1 

X L) 

1 

G F 

2 



P 8 

1 



P F 

8 







C F 

1 







M F 

1 







U F 

1 







X F 

1 
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TABLE 9-A CONTINUED. 


T i 

4 5 

A L 

3 2 

E'l 

39 

E L 

29 

V S'I 

3 4 

L L 

27 

*H 1 

3 3 

1 L 

23 

N 1 

3 0 

0 L 

19 

'R 1 

3 0 

PL 

1 3 

'D 1 

27 

B L 

6 

: E 1 

27 

U L 

6 

-L 1 

2 0 

C L 

5 

i‘A 1 

1 7 

IM L 

5 

■ W 1 

1 3 

R L 

5 

V 1 

12 

T L 

5 

h) 1 

9 

0 L 

'3 

t 

7 

F L 

2 

ip 1 

6 

G L 

2 

■ G l 

5 

S L 

2 

Q l 

5 

Y L 

2 

■ U 1 

5 

H L 

1 

Y 1 

3 

K L 

1 

■fB ! 

2 

W L 

1 

fc 1 

'2 



X 1 

2 

0 M 

2 5 

l i 

1 

A M 

14 



E M 

14 

A J 

1 

M M 

1 3 

B J 

1 

1 M 

9 

D J 

1 

R M 

9 

E J 

1 

T M 

6 

'G J 

1 

L) M 

5 

N J 

1 

N M 

5 

0 J 

1 

U M 

5 

R J 

1 

PM 

4 



S M 

3 

C K 

4 

H M 

2 

A K 

2 

L H 

2 

1 K 

2 

I 11 

Z 

N K 

2 

B Ivi 

1 

0 K 

2 

C M 

1 

R K 

1 

F M 

1 

S K 

1 

G l4 

1 



Q M 

1 


E N 

111 

0 P 

2 V 

0 N 

7 7 

E P 

2 0 

1 N 

7 5 

- R P 

1 3 

A N 

6 4 

A P 

1 2 

U N 

2 1 

P P 

1 1 

N N 

8 

S P 

10 

R N 

7 

M P 

8 

T N 

7 

1 P 

7 

Y N 

6 

D P 

5 

D W 

4 

L P 

3. 

8 N 

4 

N P 

3 

GN 

• 3 

Y P 

3 

H N 

3 

G P 

2 

W N 

2 

T P 

2 

C N 

1 

U P 

2 

K N 

1 

X P 

2 

L N 

1 

F P 

1 

P N 

1 

H P 

1 

X N 

1 





E 0 

1 2 

T 0 

5 0 

0 Q 

2 

C U 

4 1 

H Q 

1 

1 0 

4 1 

N Q 

1 

F 0 

4 0 

T Q 

1 

R 0 

2 8 



H 0 

2 0 



W 0 

19 



N 0 

18 



P (J 

1 7 



D 0 

1 6 



S 0 

15 



L 0 

1 3 



E 0 

1 2 



M 0 

1 0 



Y 0 

1 0 



G 0 

6 



0 0 

6 



B 0 

4 



A 0 

2 



J 0 

2 



U 0 

1 



V 0 

1 



X 0 

1 
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E R 

8 7 

N T 

0 R 

6 4 

S T 

A R 

4 4 

A T 

U R 

31 

R T 

1 R 

27 

E T 

P R 

1 8 

H T 

H R 

17 

1 T 

T R 

1 7 

0 T 

D R 

12 

T T 

R R 

11 

L) T 

F R 

y 

Y T 

G R 

5 

C T 

S R 

5 

U T 

C R 

4 

F T 

N R 

4 

L T 

Y R 

4 

PT 

B R 

2 

X T 

L R 

2 

G T 

Ivl R 

2 

M T 

Q R 

1 

B T 

W R 

1 

V T 

X R 

1 

0 U 

E S 

5 4 

u u 

A S 

4 1 

A U 

1 S 

3 b 

S U 

R S 

31 

H U 

N S 

2 4 

N U 

S S 

19 

D U 

T S 

19 

R U 

0 s 

1 4 

T U 

D S 

1 3 

C U 

U S 

1 2 

E U 

Y S 

11 

F U 

L S 

6 

P U 

P vS 

6 

B U 

H S 

4 

G U 

ivl S 

4 

J U 

F S 

3 

L U 

G S 

3 

ivl U 

B S 

1 

Y U 

c s 

1 


K S 

1 


W S 

1 


X S 

1 



TABLE 9-A CONCLUDED. 


8 2 

1 V 

6 3 

E V 

4 7 

A V 

4 2 

0 V 

3 7 

R V 

2 8 

D V 

2 7 

■ N V 

19 

L V 

19 

P V 

15 

S V 

15 

U V 

14 


1 2 

T W 

11 

0 V/ 

8 

E W 

8 

D W 

7 

R W 

4 

S W 

2 

A W 

1 

N W 

1 

L W 


C W 

3 7 

F W 

15 

G W 

13 

H W 

11 

P W 

8 

Y W 

7 


5 

1 X 

5 

EX 

5 

0 X 


4 

3 

3 

3 

2 

2 

2 

2 

2 

1 


2 5 T Y 4 1 

20 AY 12 

7 LY 10 

7 KY 9 

5 BY 7 

3 NY 5 

3 E Y 4 

2 HY 2 

1 0 Y 2 

1 C Y 1 

1 D Y 1 

F Y 1 

3 6 HY 1 

8 P Y 1 

7 BY 1 

4 W Y 1 

4 

4 I Z 2 

3 E Z 1 

3 7 1 1 

\ Total - 5000 
1 
1 
1 
1 
1 

15 

7 

1 
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TAELE 9-B 

THE 18 DIGRAPHS COMPOSING 25% OF THE 5000 DIGRAPHS 
V-o|' TABLE 6, ARRANGED ALPHABETICALLY ACCORDING TO" - THEIR 
„„ FINAL LETTERS, 

!. a) • (2) 

AND ACCORDING TO AND ACCORDING TO THEIR 

TifelR INITIAL LETTERS A3SOLUTE FREQUENCIES 



Fp 

60 


ED 

60 

sr\ 

NP 

52 


ND 

52 


w. 

57 


RE 

98 

£ 

M 

98 


' TE 

71 


m 

49 


NE 

57 


nb 

71 


' VE 

57 


■ -ff. 

57 


SE 

49 

i,. 


78 


TH 

78 


’ T - 






AN 

64 


EN 

111 

L* 

FN 

111 


ON 

77 


"IN 

75 


IN 

75 


. .PN 

77 


AN 

64 

- 

TO 

50 


TO 

50 



87 


- ER 

87 

;.r 

OR 

i ■ 

64 


OR 

64 

r. 

£s 

54 


ES 

54 


NT' 

82 


NT 

82 


~3T 

__63 


ST 

65 


Total - 

1249 


Total - 

1249 


RE^EV ID: A64 644 


TABLE 9-C 

THE 55 DIGRAPHS COMPOSING 50# OF THE 5000 DIGRAPHS 
OF TABLE 6, /.RANGED ALPHABETICALLY ACCORDING TO THEIR FINAL 
LETTERS, 

(1) AND ACCORDING TO THEIR (2) AND ACCORDING TO THEIR 

INITIAL LETTERS. ABSOLUTE FREQUENCIES. 


DA 

52 

CO 

41 

RA 

39 

TO 

50 

EA 

55 

FO 

40 

MA 

36 

CO 

a 

LA 

28 

10 

41 

EA 

35 

10 

41 

MA 

56 

NO 

28 

DA 

32 

FO 

40 

RA 

39 

TO 

50 

la 

28 

RO 

28 

TA 

26 

AR 

44 

TA 

28 

ER 

87 

EC 

52 

ER 

87 

EC 

32 

OR 

64 



OR 

64 



AR 

44 

ED 

60 

UP 

31 

ED 

60 

UR 

51 

ND 

52 

AS 

41 

ND 

52 

ES 

54 

CE 

32 

ES 

54 

RE 

98 

AS 

41 

DE 

33 

IS 

35 

TE 

71 

IS 

35 

EE 

42 

itS 

31 

NE 

57 

RS 

31 

LE 

37 



VE 

57 



NE 

. 57 

AT 

47 

SE 

49 

NT 

82 

PE 

98 

F.T 

37 

EE 

42 

ST 

63 

SE 

49 

HT 

28 

LE 

37 

AT 

47 

TE 

71 

NT 

32 

DE 

33 

RT 

42 

VE 

57 

RT 

42 

CE 

32 

ET 

37 



ST 

63 



HT 

28 

TII 

78 

OU 

37 

TH 

78 

OU 

57 

FI 

39 



TI 

45 



HI 

33 

TO 

36 

FI 

39 

TO 

36 

NI 

30 



SI 

34 



RI 

30 

TY 

41 

HI 

33 

TY 

41 

SI 

34 

Total - 

2495 

NI 

30 

Total - 

2495 

TI 

•15 



RI 

30 



AL 

32 



AL 

32 



EL 

29 



EL 

29 



AN 

64 



EN 

111 



EN 

111 



ON 

77 



IN 

75 



III 

75 



ON 

77 



AN 

6i 
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TABLE 9-D 

THE 117 DIGRAPHS COMPOSING OF THE 5000 DIGRAPHS OF 
TABLE 6, ARRANGED ALPHABETICALLY ACCORDING TO THEIR FINAL 
LETTERS, 

„ r , * 

(1) AND ACCORDING TO THF.IR INITIAL LEITERS. 


C A 

2 0 

E F 

18 

C 0 

4 1 

A T 

4 7 

0 A 

32 ‘ 

0 F 

2 5 

D 0 

1 6 

C T . 

1 4 

E a: 

3 5 



F 0 

4 0 

D T 

15 

H A 

2 0 

1 G 

19 

H 0 

2 0 

E T 

3 7 

L A ' 

28 

N G 

2 7 

1 0 

4 1 

H T 

28 

M A 

36 



L U 

1 3 

1 T 

27 

N.A 

2 6 

C H 

14 

N 0 

1 8 

N T 

8 2 

P, A 

1 4 

G H 

2 0 

P 0 

1 7 

0 T 

19 

H A 

39 

8 H 

2 6 

R 0 

2 8 

R T 

4 2 

S i\ 

2 4 

T H 

7 8 

S 0 

1 5 

S T 

6 3 

T A 

28 



T 0 

5 0 

T T 

19 

, 


AJ 

1 7 

W 0 

19 

V T 

1 5 

A C 

1 4 ' 

D 1 

2 7 





E C 

3 2 

E 1 

2 7 

E P 

2 0 

A U 

1 3 

1 c’ 

2 2' 

F 1 

3 9 

0 P 

2 5 

0 u 

37 

N, C, 

19 

H 1 

3 3 



Q U 

1 5 



L 1 

2 0 

A R 

4 4 



A. D 

27 

N 1 

3 0 

T R 

1'7 

E V 

2 0 

*E“d . 

6 0 

R 1 

3 0 

U R 

3 1 

1 V 

2 5 

w tr 

5 2 

S l 

34 

E R 

8 7 



R D 

17 

T 1 

4 5 

0 R 

6 4 

T H 

3 6 





P R 

1 8 



B £ 

1 8 

AL 

3 2 

H R 

1 7 

1 X 

1 5 

C E 

3 2 

E L 

29 

1 R 

2 7 



OE 

3 3 

1 L 

23 



T Y 

4 1 

EE 

4 2 

L L 

27 

A S 

4 1 

Total — 

3745 

GE 

1 4 

O' L 

19 

S S 

1 9 




2 0 



T S 

iy 



1 £ 

1 3 

A wl 

14 

0 S 

1 3 



IE 

3 7 

E M 

14 

E S 

5 4 



M £ 

2 6 

0 M 

25 

N S 

24 



N E 

57 



0 S 

1 4 



PE 

2 3 

A N 

6 4 

1 S 

35 



RE 

9 8 

E N 

ill 

R S 

3 1 



S t 

49 

1 N 

7 5 





T£ 

7 1 

0 N 

77 





■Yfc- 

5 7 

U N 

2 1 





W E 

22 
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TABLE 9-D Continued. 

(2) AND ACCORDING TO THEIR ABSOLUTE FREQUENCIES. 


H A 

39 

0 F 

25 

i-l A 

36 

E F 

i a 

E A 

35 



U A 

3 a 

N G 

2 7 

L A 

2 a 

1 G 

19 

T A 

a a 



N A 

26 

T H 

7 8 

S A 

2 4 

S H 

2 6 

C A 

20 

G H 

a 0 

H A 

2.0 

C H 

14 

P A 

1 4 

T 1 

4 5 

E C 

32 

F 1 

3 9 

1 C 

22 

S 1 

34 

N C 

19 

H I 

3 3 

A C 

14 

i-i 1 

3 0 



R 1 

3 0 

E D 

6 0 

U 1 

27 

N D 

5 2 

E 1 

2 7 

A 0 

27 

L 1 

2 0 

H D 

17 

A 1 

1 7 

R E 

9 8 

A L 

3 2 

TE 

7 1 

E L 

2 9 

N E 

57 

L L 

2 7 

V E 

57 

1 L 

2 3 

S E 

49 

0 L 

19 

E E 

4 2 



LE 

37 

0 \^ 

25 

0 E 

3 3 

A M 

14 

C E 

3 2 

' E H 

14 

M E 

26 



P E 

2 3 

E IM 

111 

W E 

2 2 

0 H 

77 

H E 

2 0 

1 N 

7 5 

B E 

1 8 

A N 

64 

G E 

1 4 

U N 

21 

1 E 

1 3 




T 0 

5 0 

N T 

8 2 

C 0 

4 1 

S T 

6 3 

1 0 

4 1 

A T 

4 7 

F 0 

4 0 

R T 

4 2 

R 0 

2 8 

E T 

3 7 

H 0 

2 0 

H T 

2 8 

W 0 

19 

1 T 

2 7 

N U 

1 8 

0 T 

19 

P 0 

1 7 

T T 

1 9 

D 0 

1 6 

D T 

1 5 

S 0 

1 5 

Y T 

1 5 

L 0 

1 3 

C T 

1 4 

0 P 

2 5 

0 u 

37 

E P 

2 0 

Q U 

1 5 



A U 

1 3 

E R 

8 7 



0 R 

6 4 

1 V 

2 5 

A R 

4 4 

E V 

2 0 

U R 

3 1 



1 R 

2 7 

T W 

36 

P R 

1 8 



H R 

1 7 

1 X 

1 5 

TR 

1 7 



E S 

A S 

5 4 

4 1 

T Y 

4 1 

Total - 

3745 

1 S 

35 



R S 

31 



IM S 

24 



8 S 

19 



T S 

19 



0 S 

1 4 



D S 

1 3 




TABLE 9-E 

AIiL THE 438 DIFFERENT DIGRAPHS OF TABLE 6 ARRANGED ALPHA¬ 
BETICALLY FIRST ACCORDING TO THEIR FINAL LETTERS AND THEN 
ACCORDING TO THEIR INITIAL LETTERS. 

(SEE TABLE 6. READ DOPN THE COLUMNS) 
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TABLE 10 A 


P 
: 1 


THE 56 TRIGRAPHS APPEALING 100 OR MORE TIMES 
THE '50,000 LETTERS OF 'GOVERNMENT PLAT N-TEXT TELEGRAMS 
ARRANGED ACCORDING TO THEIR ABSOLUTE FREQUENCIES 


&NT 569 FOU 

ION 260 ORT 


AND 

ING 


JVE 

TlO 

FOR 

'OUR 

'THI 


NIN 

STO 

EEN 


INE 

TEN 

EVE 

EST 

TEE 

TOP 

NTH 

TWE 

TWO 

ATI 

THE 

NTY 

TTRFi 


228 

226 

225 

221 

218 

211 

211 

210 

207 

202 

196 

196 

192 

190 

177 

176 

174 

174 

171 

170 

163 

160 

158 

157 

153 

153 


REE 

SIX 

ASH 

DAS 

IGH 

ERE 

COM 

ATE 

EIG 

FIV 

MEN 

SEV 

ERS 

UND 

NET 

PER 

STA 

TER 

EQU 

RED 

TED 

ERI 

HIR 

IRT 

DER 

DRE 


152 

146 

146 

146 

143 

140 

140 

138 

136 

135 

135 

135 

131 

131 

126 

125 

118 

115 

115 

115 

114 

113 

112 

109 

106 

105 

101 

100 


WEN 
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TABLE 10 B 


THE 56 TRIGRAPES APPEARING 100 OR MORE TIMES 
IN THE 50,000 LETTERS OF GOVERNMENT PLAIN-TEXT TELEGRAMS 
ARRANGED FIRST ALPHABET ICALLY ACCORDING TO THEIR INITIAL LETTERS 
AND THEN ACCORDING TO THEIR ABSOLUTE FREQUENCIES 


AND 

228 

ATI 

160 

ASH 

143 

ATE 

135 

COM 

136 

DAS 

140 

DER 

101 

DRE 

100 

ENT 

569 

EEN 

196 

EVE 

177 

EST 

176 

“ERTil 

138 

EIG 

135 

ERS 

126 

EQU 

114 

ERI 

109 

FOR 

218 

FOU 

152 

FIV 

135 

GHT 

196 

HRE 

153 

HIR 

106 

ION 

260 

ING 

226 

IVE 

225 

INE 

192 

IGH 

140 

IRT 

105 


MEN 

131 

NIN 

207 

NTH 

171 

NTY 

157 

NET 

118 

OUR 

211 

ONE 

210 

ORT 

146 

PER 

115 

REE 

146 

RED 

113 

STO 

202 

SIX 

146 

SEV 

131 

STA 

115 

TIO 

221 

THI 

211 

TEE 

174 

TOP 

174 

TWE 

170 

TWO 

163 

THR 

158 

TER 

115 

TED 

112 

UND 

125 

VEN 

190 

WEN 

153 
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TABLE 10 C 


THE'56 YRIGRAFHS APPEARING 100 OR MORE TIMES 
IN THE 50,000 LETTERS OP GOVERNMENT PLAIN-TEXT TELEGRAMS 
ARRANGED FIRST ALPHABETICALLY ACCORDING TO THEIR CENTRAL LETTERS 
* AND THEN ACCORDING TO THEIR ABSOLUTE FREQUENCIES 


DAS 

140 

ION 

260 



FOR 

218 

EEN 

196 

TOP 

174 

VEN 

190 

FOU 

152 

TEE 

174 

COM 

136 

TON 

153 



REE 

146 

EQU 

114 

MEN 

131 



SEV 

131 

HRE 

153 

NET 

118 

ORT 

146 

PER 

115 

ERE 

138 

TER 

115 

ERS 

126 

RED 

113 

ERI 

109 

TED 

112 

IRT 

105 

DER 

101 

DRE 

100 

IGH 

140 

EST 

176 


ASH 

143 

THI 

211 



GET 

196 

STO 

202 

THR 

158 

NTH 

171 


ATI 

160 

TIO 

221 

NTY 

157 

NIN 

207 

ATE 

135 

SIX 

146 

STA 

115 

EIG 

135 

OUR 

211 

FIT 

135 


HIR 

106 

IVE 

225 

ENT 

569 

EVE 

177 

AND 

228 



ING 

226 

TTO 

170 

ONE 

210 

TITO 

163 

INE 

192 


UND 

125 
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TABLE 10 D 


THE 56 TRIGRAPHS APPEASING 100 OS MORE TIMES 
IN THE 50,000 LETTERS OF GOVERNMENT PLAIN-TEXT TELEGRAMS 
ARRANGED FIRST ALPHABET I GALLY ACCORDING TO THEIR FINAL LETTERS 
AND THEN ACCORDING TO THEIR ABSOLUTE FREQUENCIES 


STA 

115 

TIO 

221 



STO 

202 

AND 

228 

TWO 

163 

UND 

125 



RED 

113 

TOP 

174 

TED 

112 

FOR 

218 

ITE 

225 

OUR 

211 

ONE 

210 

THR 

158 

USE 

192 

PER 

115 

EVE 

177 

TER 

115 

TEE 

174 

HIR 

106 

TWE 

170 

DER 

101 

HRE 

153 



REE 

146 

DAS 

140 

ERE 

138 

ERS 

126 

ATE 

135 

ENT 


DRE 

100 

569 


GHT 

196 

ING 

226 

EST 

176 

EIG 

135 

ORT 

146 


NET 

118 

NTH 

171 

IRT 

105 

ASH 

IGH 

143 

140 

FOU 

EQU 

152 

114 

THI 

211 

FIT 

135 

ATI 

160 

SET 

131 

ERI 

COM 

109 

136 

SIX 

146 

ION 

260 

NTY 

157 

KIN 

207 



een 

196 



TEN 

190 



WEN 

153 



MEN 

131 
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TABLE 11 A 

THE 54 TETRAGRAPHS APPEARING 50 OR MORE TIMES 
jan© 50 j000 LETTERS OF GOVERNMENT PLAIN-TEXT TELEGRAMS 
ARRANGED ACCORDING TO THEIR ABSOLUTE FREQUENCIES 


TION 

218 

OMMA 

71 

EVEN 

168 

LLAR 

71 

TEEN 

163 

OLLA 

70 

ENTY 

161 

VENT 

70 

STOP 

154 

DOLL 

68 

TRENT 

153 

LARS 

68 

me" 

153 

THIS 

68 

TYiiEN 

152 

PE5RI 

67 

THRE 

149 

ERIO 

66 

FOUR 

144 

ASET 

64 

IGHT 

140 

HUND 

64 

FIVE 

135 

DRED 

63 

HREE 

134 

RIOD 

63 

eIgS ' 

132 

IVED 

62 

DASH 

132 

ENTS 

62 

SEVE' 

121 

FPIC 

62 

ENTH 

114 

FROM 

59 

MENT 

111 

IRTY 

59 

THIR . 

104 

RTEE 

59 

EENT 

102 

UNDR 

59 

REQU 

98 

NAUG 

56 

HIRT 

97 

OURT 

56 

COMM 

93 

UGHT 

56 

QUES 

87 

STAT 

54 

UEST 

87 

AUGH 

52 

EQUE 

86 

CENT 

52 

NDRE 

77 

FICE 

50 


' t. 
l'- 
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TABLE 11 B 

THE 54 TETRAGRAPHS APPEARING 50 OR MORE TIMES 
IN THE 60,000 LETTERS OF GOVERNMENT PLAIN-TEXT TELEGRAMS 
ARRANGED FIRST ALPHABET ICALLY ACCORDING TO THEIR INITIAL LETTERS 
AND THEN ACCORDING TO THEIR ABSOLUTE FREQUENCIES 


ASHT 

64 

AUGH 

52 

COMM 

93 

CENT 

52 

DASH 

132 

DOLL 

68 

DEED 

63 

EVEN 

168 

ENTY 

161 

EIGH 

132 

ENTE 

114 

EENT 

102 

EQUE 

86 

ERIO 

66 

ENTS 

62 

FOUR 

144 

FIVE 

135 

FFIC 

62 

FROM 

59 

FICE 

50 

hree 

134 

HIRT 

97 

HOND 

64 

IGHT 

140 

IVED 

62 

IRTY 

59 

LLAR 

71 

LARS 

68 


MENT 

111 

NINE 

153 

NDRE 

.77 

NAUG 

56 

OMMA 

71 

OLLA 

70 

OURT 

56 

PERI 

67 

QUES 

87 

REQU 

98 

RIOD 

63 

RTEE 

59 

STOP 

154 

SEVE 

121 

STAT 

54 

TION 

218 

TEEN 

163 

THEN 

152 

THRE 

149 

THIR 

104 

THIS 

68 

UEST 

87 

UNDR 

59 

UGHT 

56 

VENT 

70 

VffiNT 

153 
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TABLE 11 C 

THE 54 IETRAGRAPHS APPEARING 50 OR MORE TIMES 
IN TRfc 50,000 LETTERS OP GOVERNMENT PLAIN-TEXT TELEGRAMS 
j&HRJJjGED FIRST ALPHABET I GAILY ACCORDING TO THEIR SECOND LETTERS 
' - AND THEN ACCORDING TO THEIR ABSOLUTE FREQUENCIES 


DASH 

132 

OMMA 

71 

LARS 

68 



NAUG 

56 

ENTY 

161 



ENTH 

114 

NpftE 

77 

ENTS 

62 



UNDR 

59 

TEEN 

163 



WENT 

153 

FOUR 

144 

SEVE 

121 

COMM 

93 

MENT 

111 

DOLL 

68 

EENT 

102 



REQU 

98 

EQUE 

86 

UEST 

87 

HREE 

134 

VENT 

70 

ERIO 

66 

PERI 

67 

DRED 

63 

CENT 

52 

FROM 

59 



IRTY 

59 

FFIC 

62 





ASHT 

64 

IGHT 

140 



UGHT 

56 

STOP 

154 



RTEE 

59 

THRE 

149 

SEAT 

54 

TEDi 

104 



THIS 

68 

QUES 

87 



HUND 

”64 

TION 

218 

OURT 

56 

NINE 

153 

AUGH 

52 

FIVE 

135 



EIGH 

132 

EVEN 

168 

HIRT 

97 

IVED 

62 

IlIOD 

63 



FICE 

50 

THEN 

152 

LLAR 

71 



OLLA 

70 
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TABLE 11 D 


THE 54 TETRAGRAPHS a-PPEARING 50 OR MORE TIMES 
IN THE 50,000 LETTERS OF GOVERNMENT PLAIN-TEXT TELEGRAMS 
ARRANGED FIRST ALPHABETICALLY ACCORDING TO THEIR THIRD LETTERS 
AND THEN ACCORDING TO THEIR ABSOLUTE FREQUENCIES 


LIAR 

71 

SLAT 

54 

FICE 

50 

UNDR 

59 

EVEN 

168 

TEEN 

163 

TWEN 

152 

HREE 

134 

QUES 

87 

DRED 

63 

IVED 

62 

RTEE 

59 

EIGH 

132 

AUGH 

52 

IGHT 

140 

ASHT 

64 

UGHT 

56 

THIR 

104 

THIS 

68 

ERIO 

66 

FFIC 

62 

OLLA 

70 

DOLL 

68 

comm 

93 

OMMA 

71 


YIENT 

153 

NINE 

153 

MENT 

111 

KENT 

102 

VENT 

70 

HUND 

64 

CENT 

52 

TION 

218 

STOP 

154 

RIOD 

63 

FROM 

59 

REQU 

98 

thre 

149 

HIRT 

97 

NDRE 

77 

TARS 

68 

PERI 

67 

OURT 

56 

DASH 

132 

UEST 

87 

ENTY 

161 

ENTH 

114 

ENTS 

62 

IRTY 

59 

FOUR 

144 

EQUE 

86 

NAUG 

56 

FIVE 

135 

SEVE 

121 
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HJ3LE 11 E 


THE 54 TETRAGRAPHS APPEARING 50 OR MORE TIMES 
Itf fllE 50,000 LETTERS OF GOVERNMENT PLAIN-TEXT TELEGRAMS 
ARRANGED FIRST ALPHABETIC&LLY ACCORDING TO THEIR FINAL LETTERS 
AND THEN ACCORDING TO THEIR ABSOLUTE FREQUENCIES 


atm 

71 

TION 

218 

OLLA 

70' 

EVEN 

168 

-- 


TEEN 

163 

FFIC 

62 

TWEN 

152 

HUND 

64 

ERIO 

66 

DRED 

63 



RIOD 

63 

STOP 

154 

IVED 

62 

FOUR 

144 

"NINE 

153 

THIR 

104 

HIRE 

149 

LLAR 

71 

' FIVE 

135 

UNDR 

*59 

HREE 

134 



‘'SEVE 

121 

QUES 

"87 

EQUE 

86 

THIS 

68 

NDRE 

77 

LARS 

68 

RTEE 

59 

ENTS 

62 

FICE 

50 

WENT 

153 

NAUG 

56 

IGHT 

140 



.MENT 

111 

DASH 

132 

EENT 

102 

EIGH 

132 

HIRT 

97 

ENHI 

114 

UEST 

87 

AUGII 

52 

YENT 

70 



. ASET 

. ' 64 

PERI 

67 

UGHT 

‘56 



OURT 

56 

DOLL 

68 

STAT 

54 



CENT 

52 

COMM 

93 



FROM 

59 

REQU 

98 



ENTY 

161 



IRTY 

59 
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TABLE 12 


AVERAGE AND MEAN LENGTHS OP WORDS 



No.of Letters 

No.of Times 

No. of 


In Word 

Word Appears Letters j 


1 

378 

378 


2 

973 

1946 


3 

1307 

3921 


4 

1635 

6540 


5 

1410 

7050 


6 

1143 

6858 


7 

1009 

7063 


8 

717 

5736 


9 

476 

4284 


10 

274 

2740 


11 

161 

1771 


12 

86 

1032 


13 

23 

299 


14 

23 

322 


15 

4 

60 


120 

9619 

50000 

(1) 

Mean Length 

50000 
“ "9619 

5.2 Letters 

(8) 

Average Length 

of messages . . . 

217 Letters 

(8) 

Mean Length of 

messages . 

191 Letters 

(4) 

Mode (Most frequent) Length . . . 

105 - 114 Letters 

(5) 

It is extremely unusual to find 5 
at least one vowel. 

consecutive letters without 

(6) 

The average number of letters between vowels is 2. 
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Page 
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Accented letters 

11 

6b 
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> 73 

36c 

" , deciphering 

64 

31c 

" , direct standard 

26,30,33 

12a,16,19 

" , enciphering 

60,64 

29b,31c 

" , keyword-mixed 

66 

31d 

" , mixed 

2.6,30,33, 

12a,15a,19,21d, 
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" , reversed standard 

26,30,40,44 
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11 -, standard 

26,30,33, 
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20b,23,38e 

*' , systematically mixed 

63 
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Analytic key for cryptanalysis 

13,125 
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Arbitrary symbols 

27,121,122 

13h,47d,48 
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113 

46h 
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23 

lib 

Ba.conian' cipher 

74 

35e 

Bar distribution 

16 

9a 

, Beginnings of messages' ' 

68 

32e 

Biliteral substitution 

85 

41 

Bipartite alphabet 

73 

35b, c 

Blanks, number of 

* 29 

14 e 

Book systems 

124 

49e 

Censorship, methods for evading 

121 

47c 

Characteristic frequency of 



the letters of a language 

17,28,51 

9d,34b,26 

Characteristic frequency of the letters 


of a language, suppression of 

77,37 

37,41f 

Checkerboard systems 

69,101 

44,43 

Checkerboards, 4-square 

, 39 

44 

Cipher,'Baconian 

74 

3.6e 

11 component 

70 

34 

” j distinguished from code 

79 

38c 

B text, length of as 



compared with plain text 

85 

40c 

" unit 

3.6 

41c 

Glassification of ciphers 

24,25,120,126 

12a,13,47,“>0e,f 

Code systems 

10,88,120,122 

4a,41g,47b,4« 

" " , distinguished from cipher 79,HI 

33c,24c,footnote 

Completing the plain component 

41,71 

20a,34a 

Concealed messages 

120 

47c 

Condensed table of repetitions 

.57 

27i 

Consonants distinguished from vowels 

57,66 

23,32c 

" , relative frequency of 

20,2.6,38 

10a,13,19 

” , in succession 

66 

32c 

Conversion of cipher text 

46,47,72 

21a,c,34c 

Coordinates on work sheet 

■62 

26d 

Coordination of services 

7 

2e 
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Page 

Paragraph 

Crests and troughs 

20 

28,37,92 

10a,14b,41f, A 4c 

" " " , absence of 


29 

14c 

Deciphering alphabets 


64 

31c 

Dictionary words used as code words 


120 

47b 

Digraphic substitution 


*6,83 

41c,42,43 

Digraphs, characteristic frequency 


51 

25 

" , weighted according 




to relative frequency 


60 

29 

Distribution, bar type 


16,23 

9a,lib 

" , normal 


32 

17b, c 

” , with no crests and troughs 


29 

14c 

Dummy letters 


121 

47c 

Elementary sounds, characteristic frequency 

28 

14b 

Enciphering alphabet 


64 

31c 

Endings of messages 


68 

32e 

Equivalent values 


31 

39b 

Figure ciphers 


27,122 

15h,43 

Fitting distribution to normal 


32,38,80 

17b,c,19,38e 

Foreign language cryptograms 


11,15 

5b,7c 

Formulas 


69 

33d 

frequency distribution 16,31, 

38 

,53,92,93 

9,17,19,26e,44c 

" n . i ~fitted to normal 


32 

17b, c 

" " , for code 


120 

47b 

" " , four part 


30 

38d 

" " , monoliteral 


16,31 

9,17 

» " , trigraphic 


54 

27 

" method of solution 


34,51,60 

18,24d,29 

General solutions in cryptanalysis 


119 

46 i 

” . system, determination of 10,12,25,125 

4a,6,13,50 

Generatrix 


14 

20a 

Goodness of fit 


52 

17b 

Grilles 


121 

47c 

Hidden messages 


121 

47c 

High frequency consonants 


26 

13d 

Historical examples of polyliteral systems 

75 

36 

Idiomorphism 


70 

33e 

Indicators 


123 

49b 

Intelligence facilities 


6 

2e 

Intelligible text obtained by chance 


47 

21b 

Intuitive method 


68 

33 

Invisible writing 


1 

11 

Japanese Morse alphabet 


11,122 

5b,43b 

Kata Kana Morse alphabet 


122 

48b 

Key phrase 


77 

36c 

Known sequences 


49 

23a 

Language employed in a cryptogram 


10 

4a, * 

" frequency characteristics 


17, ">1 

9d,25 

" • peculiarities 


11 

5b 
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Pago 

Paragraph 

Letters, accented 

11 

5b 

" , low frequency 

64 

31c 

” jj, missing 

11,29 

5b,14e 

Low-frequency consonants 

26,64 

13d,31c 

Medium frequency consonants 

26 

13d 1 

Messages, beginnings and endings 

amenable to cryptanalysis 

68 

32e 

" , general phraseology 

125 

49a 

" , hidden 

121 

47c 

Military text 

21 

10b 

Missing letters 

11,29 

5b,14e 

Mixed alphabet 

30,48,50,64 

15a,22b,24c,31b 

n sequence 

48 

21d 

Modified Playfair 

106 

46d 

Monoalphabets 

1 

lb 

Monoalphabet distinguished from polyalphabet 24,28 

12,14 

Monoliteral frequency distribution 

16,31 

9,17 

Morse alphabet, Japanese, Russian 

11,122 - - 

5b,48b 

Normal distribution 

32 

17b, c 

n frequency 

17,21,51 

9,11,25 

" " , deviations from 

26 

13b 

Nulls 

83,121 

40,47c 

New York Tribune 

75 

36 

Patterns 

69 

33d 

Pentaliteral cipher 

74 

35e 

Phraseology of messages 

123 

49a 

Plain component, completion of 

41 

20a 

Plain-text unit 

85 

41c 

Playfair cipher 

94,102 

44,46 

" , modified 

106 

46d 

Polyalphabotic cipher distinguished 
from monoalphabot 

24,28 

• 12,14 

Polygraphic substitution 

85 

41 

Polyliteral substd tution 

73,77,85 

35,37,41c 

11 systems, historical examples of 75 

36 

Prefixes in trigraphic distribution 

55 

27e 

Prerequisites for cryptographic work 

2 

2 

Probable-word method 

68 

33 

Pseudo-polygraphic systems 

87 

41e 

Punctuation in telegraphic text 

21 

10c 

Random text, number of blanks 

29 

14f 

Relative frequencies 

21,28 

10b,c,d,ll,14b 

Repetitions 

27,50,51,56 

13g,24b,24c,27 

" , in a code message 

79 

38c 

" ' , of consonants 

66 

32c 

" , of digraphs and trigraphs 

56 

27f 

" , condensed table of 

57 

27i 


< 
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Page 

Paragraph 

Reversed standard alphabets 

30,64 

16,20b 

Reversible digraphs indicated on worksheet 53 

26f 

Russian Morse alphabet 

Security of monoalphabct using 

11,122 

5b,48b 

standard alphabets 

49 

23 

Sequences, known 

49 

23a 

" , mixed 

48 

21d 

" , unknown 

49 

23a 

Solutions of a subjoctivc nature 

8 

3 

Specific key 

10,14,40,64 

4,7,19a,31b 

Standard alphabets 

30,44,80 

15a,16,20b,38e 

Subjective solutions 

8 

5 

Substitution, bi3iteral 

85 

41 

” , digraphic 

" , distinguished from 

86,88 

41c,42a 

transposition 

24,25 

12,13 

" , polygraphic 

85 

ilc 

" , polyliteral 

85 

41c 

" , trigraphic 

* 85 

41c 

" , triliteral 

85 

41 

Suffixes in trigraphic distribution 

55 

27e 

Suppression of frequency 

77,84,87 

37,40b,41f 

Symbols as cipher elements 

27,121,122 

15h,47d,48 

Telegrams, average length of 

23 

lib 

Terminology 

1 

1 

Text, different types 

Transposition distinguished from 

21 

10b, c 

substitution 

24,25 

12,13 

Trigraphic cipher system 

85 

41c 

" frequency tjble 

54 

27 

Triliteral frequency distribution 

54 

27 

" substitution 

85 

41 

Type numbers for cryptographic systems 

126 

50f 

Unknown sequences 

49 

23a 

Variants 

78,84,123,124 

37d,40b,49c,d,49f 

Vowels, average distance apart 

66 

32c, footnote 

" , combinations with consonants 

57,58 

28,29 

" , " ,T vowels 

59 

29a 

" , distinguished from consonants 

57,66 

28,32c 

" , in succession 

66 

32c 

" , relative frequency o? 

20,25,38 

10n,13,19 

Word formulas 

69 

33d 

11 lengths in a cryptogram 

52,68,69,70 

26c,32e,33d,33f,g 

" patterns 

69 

33d 

" skeletons 

61,68 

30b,32o 

Work sheet, preparation of 

51 

26 
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few 


Analytical JCey far Cryptanalysis (See Par. 50.) 
(Hmibers in parenthesis refer to Paragraph Nos. in this 


402-?olyalphabetic 
(To he taken 
_up l a t e r ) _ 


i 502-SystenatioaUy derlTed 
_ fiamjalaaia _ 

( 44 )|_ 


32-Mo no graphic 

(12)1 


30 3-Tr an spo si tion 
(To he taken up 
_later)_ __ 


304-Substitution 


I 405-ftonoalphabetic 
(35) 


(174, 22b)_!_, 

| 305-Conibined substitution- I 
I transposition I 

i_(To he_taken_uj>_Inter) | 


40i -Polyalphahetic 
(To he taten 
_ uu later) _ 


|503-Monoliteral | 
(41) X. 


1504-Polyliteral 

(37K 


1601-Sigraphic j - jg6g-Trl-graphic| |603-Etc-. | 604-Digraphie I 605-Trigraphicj | 60C-Etc. I 607-Ciphers oompoBed ex- 608-Ciphers composed of 

1 (44g) _ 1 (44g) 1 clusively of letters figures, symbols, or 

(44 - 46) (15 , 22 h, 23 , 24( intermixtures of letters, 

I _ I . figures aid symbols. 

__ J _ ..—_1_ _I - .. .. _1_ (Substitute letter equiv- 

701-Sliding alphabets 702-Checkerboard 703-Standard Cipher 704-Mixed Cipher alents and proceed as 

designs Alphabet Alphabet with Type 607.) _ 

(45 b) _ (47 -8) (17 - 22) (25 - 34) 

1---r 1 1 1 ~ ' * ;-f r 1 ~ --r 

|80l-4-alphabet 802-2-alphabet 803-1-alphabe-tj 804-Direcl 805-Reversec 1806-Systeraaticallyl 807-Random mixes 
| designs designs designs | mixed ) 

(48) (48) (Playfair) (19a,20a) (19b, 20b) j (4L 0 - e) 

1 _ _ - —I (49) _L____ |- 1 -) __ 

I 901-Keyword mixed! I 922-Other types 


610-?oly-equivalent 
substitution 
(57 . 40) _ 


705-Number Ciphers 706-Letter Ciphers 707-Number 708-Letter 
Ciphers Ciphers 

(36 h, c) (55a-,e, 36a) (SB) 


808-4-alphabet 

309-Rectangles 310-Book 

schemes with 

method 

numbers frco 


00-99 


(38 a - e) 

(38 f) (42 e) 
















